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PHAGOCYTOSIS GENES AND USES THEREOF 



RELATED APPLICATIOHS 

This application is a continuation-in-part of and claims priority to 
Patent Application No. 9820816.8 filed September 24, 1998 and U.K. Patent 
5 Application No. 9812660.0 filed June 1 1, 1998; and is a continuation-in-part of 
and claims priority to U.S. Application No. 09/096,347, filed June 11, 1 998 and 
U.S. Application No. 09/096,631, filed June 1 1, 1998; and claims the benefit of 
U.S. Provisional Application No. 60/072,324, filed January 23, 1998. The 
teachings of all of the referenced applications are incorporated herein by reference 
10 in their entirety. 

GOVERNMENT STff PORT 

The invention was supported, in whole or in part, by Grant GM52540 from 
the National Institutes of Health, The Govemment has certain rights in the 
invention. 

15 BACKGROUND TO THE TNVENTTON 

Phagocytosis or engulfinent, is a specialized form of endocytosis through 
which eukaryotes take up very large particles, or even whole cells. It is a 
fundamental biological process conserved from single-cell organisms, such as 
amoebae to mammals (Metchnikoff, E. 1891), Lectures on the comparative 

20 pathology of inflammation; delivered at the Pasteur Institute, 1 89 1 , 1 968 Edition 
(New York: Dover Publication)). Initially used for the dual purpose of feeding 
and defence, phagocytosis evolved, following the emergence of mesoderm, into a 
mechanism used to protect the host against invading organisms and to clear up 
foreign particles and cell debris (Metchnikoff, 1891). Recently, the significance of 

25 phagocytosis has been extended due to its role in eliminating cells undergoing 
programmed cell death (apoptosis). Since apoptosis has been implicated in a 
number of human diseases elucidation of the regulation of this phagocytosis is 
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highly desirable since it may lead to a new route of therapeutic intervention in 
these diseases. Accordingly, a need exists to isolate a gene and protein that 
regulate phagocytosis. A further need exists for therapeutic treatment for diseases 
related to phagocytosis of apoptotic cells. 

5 STJMMARY OF THE INVENTION 

Genetic studies in C elegans have identified over a dozen genes that 
function in programmed cell death. The present inventors have used the positional 
method to clone and have functionally characterized the C elegans gene CED-6. 
It is shown that the CED-6 protein contains a phosphotyrosine binding domain and 

10 several potential SH3 binding sites. It is further demonstrated that CED-6 acts 
within engulfing cells, and functions to promote the removal of both early and 
persistent cell corpses. Overexpression of CED-6 can partially suppress the 
engulfinent defect of both CED-1 and CED-7, suggesting that CED-6 functions 
downstream of these two genes, CED-6 acts as an adaptor molecule in a signal 

15 transduction pathway that mediates the engulfment of apoptotic cells in C. elegans. 
The present iaventors have also identified isolated and characterized human CED- 
6 homologue including a splice variant thereof, which it is shown is involved in a 
similar process in mammalian cells. 

The invention provides, in isolated form, a protein which is the CED-6 

20 protein of C elegans or a protein which has equivalent function thereto and human 
homologues of the protein, hereinafter referred to as hi CED-6, h2CED-6, and 
h3CED-6. 

The invention further provides a functional fragment of CED-6, hlCED-6, 
h2CED-6 and h3 CED-6, for example, a fragment corresponding to the 
25 phosphotyrosine binding domain and/or the proline/serine rich region. 

The invention further provides an isolated nucleic acid encoding CED-6 
and human homologues of CED-6, as well as nucleic acid encoding fimctional 
fragments of CED-6, hi CED-6, h2-CED-6 and h3-CED-6 as described above. 
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The invention further provides nucleic acid which is antisense to any of the 
nucleic acids described above or which is capable of hybridizing to any of the 
nucleic acids described above under conditions of low, medium or high stringency 
or portions or fragments thereof 
5 The invention further provides expression vectors comprising nucleic acid 

encoding CED-6, hlCED-6, h2CED-6, h3CED-6 or encoding functional fragments 
of said proteins as above. 

The invention further provides mammalian cell-lines transfected with one 
or more nucleic acids encoding CED-6, hlCED-6, h2CED"6, and/or h3CED-6. 
10 The invention further provides assay methods using the proteins, nucleic 

acids and transfected cells described above to identify compounds which enhance 
or inhibit the signal transduction pathway in which CED-6, hi CED-6, h2CED-6, 
and/or h3CED-6 participate. 

The invention further provides assay methods using the transfected cells 
15 described above to identify compounds which enhance or inhibit the expression of 
die CED-6, hlCED-6, h2CED-6 or h3CED-6 genes. 

The invention further provides antibodies which react with an epitope of 
CED-6, hlCED-6, h2CED-6, and/or h3CED-6. 

The invention further provides a method of treating diseases the etiology of 
20 which may be attributed to failure of engulfment of apoptotic or other diseased 
cells such as inflammation autoimmune disease or cancer by administering to a 
patient one or more of the aforesaid proteins or nucleic acids or compounds which 
are enhancers of CED-6, hlCED-6, h2CED-6 or h3CED-6. 

The invention further provides a method of treating diseases which would 
25 benefit fi-om a reduction in the engulfment of apoptotic cells, such as, 

neurodegenerative diseases, stroke, or sickle-cell anaemia, by administering one or 
more of the aforesaid proteins, nucleic acids or compounds which are inhibitors of 
CED-6, hlCED-6, h2CED-6, or h3CED-6. 

The invention further provides a method of diagnosis of a human or animal 
30 disease using a nucleic acid encoding CED-6, hi CED-6, h2CED-6 or h3CED-6 or 
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the complement thereof or an antibody to CED-6, hlCED-6, h2CED-6 or h3CED- 
6 to detect a genetic defect. 

The invention further provides a method of identifying proteins which 
interact with CED-6, hlCED-6, h2CED-6 or h3CED-6 in the signal transduction 
5 pathway in which those proteins participate. 

The invention further provides a fusion protein in which CED-6, hi CED-6, 
h2CED-6 or h3CED06 or a fiinctional fragment thereof such as the 
phosphotyrosine binding domain or serine proline rich region, is fused to another 
protein such as an epitope tag or product of a reporter gene . 
1 0 The invention further provides a method of determining whether a 

compound is an enhancer or inhibitor of the signal transduction pathway in which 
CED-6 participates by observing the effect of the compound on C elegans worms 
having altered CED-6 expression, 

RRTFF DESCRIPTION O F THE DRAWINGS 

15 Figuure lA - IE are schematic representation of the CED-6 Locus. Figure 

1 A Genetic map of CED-6, CED-6 and some genes close to and also used to map 
CED-6 are shown. Figure IB Cosmid rescue. Transgenic animals carrying 
cosmids or subcloned DNA fragments (see C, D) were examined for cell corpses 
on three fold embryos. Those who gave embryos with partial or no cell corpses 

20 were counted as rescuing transgenic hnes. Four out of tested thirteen cosmids are 
shown. Rescuing fragments are bold. Number represents # rescuing lines/ # lines 
tested. Figure ICSubcloning of F56D2 cosmid and rescue. Restriction map of 
the CED-6 region is shown on the top. In the middle, several restriction fragments 
were tested for their ability to rescue the engulfinent defect caused by 

25 CED'6(nI813). Figure ID Subcloning of 1 0 kb Xho I fragment and rescue. 

Restriction map of Xho I fragment is shown on the top. In the middle mutations 
made on the Xho I fragment and their rescuing ability are shown. An X indicates a 
frameshift mutation (see Experimental Procedures for details). Figure IE 
Transcripts on Xho I fragments. Intron/exon structure of the transcripts on Xho I 
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fragment region. Boxes: exons; V symbol: introns. AAA: poly(A) tail RT-PCR 
products of 5' end of F56D2.7 contain both SLl and SL2. 

Figures 2 A and B shows that F56D2.7 Encodes CED-6. Figure 2 A shows 
the full-length cDNA (SEQ ID NO: 1) and amino acid (SEQ ID NO: 2) of C 
5 elegans CED-6. Double underline shows the nucleic acid (SEQ ID NO: 3) and 
amino acid sequence (SEQ ID NO: 4) of phosphotyrosine binding (PTB) domain; 
single underline indicates the nucleic acid (SEQ ID NO: 6) and the amino acid 
(SEQ ID NO: 7) sequence of the proline/serine rich region. Dashed underline 
indicates charged region. Star identifies the prolines in the PxxP signature 

10 sequence, empty triangles the charged residues within the dashed region. Shaded 
box indicates polyadenylation signal. Both SLl and SL2 could be added to 
transplicing acceptor site. The single base pair deletion identified in 
CED'6(nl8I3) is shown. Figure 2B Southern blot which revealed a RFLP on 4. 1 
kb fragment from CED'6 (n2095). Xho I probe identifies an allele-specific RFLP 

15 in CED'6(n2095) that affect a 4. 1 kb Hind III fragment containing F56D2.7. On 
the right bottom the genomic fragments digested by Hind III on the Xho I fragment 
region is shown. On the right top Xho I fragment and three genes covered on this 
region. Three HindUl fragments, 4.1kb, 0.4 kb and 9.9 kb that should be Ughted 
up on the Southern blot are indicated. On the left genomic DNA isolated 

20 independently from wild-type N2, CED-6(nl813) and CED-6(n2095) were probed 
with ^"'P-labeled^o I fragment n2095 allele showed the missing of the 4.1 kb 
fragment and the extra 2.1 kb fragment. o.4 kb fragments were not affected in 
botii alleles (data on a separate gel, not shown here). 

Figure 3A-C show that CED-6 Contains a Phosphatyrosine Binding 

25 Domain. Figure 3 A shows that alignment of CED-6 PTB (SEQ ID NO: 4) with 
other PTB domain. The PTB domain alignment was based on the NMR structure 
of She protein. Black boxes indicate identical amino acids showed by >50% of 
sequences. Grey boxes indicate similar amino acid showed by >50% of 
sequences. For this purpose, the following sets of amino acids are considered 

30 similar: G, A, C, S, T; E, D, Q, N; R, K, H; V, M, L, I; F, Y, W.a indicate the a 
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helices suggested by the NMR structure of She, and p the P sheats. Invariant 
residues (found in all sequences shown) are highlighted by star, Figure 3B 
shows the comparison of CED-6 to other PTB domain containing proteins. Proline 
rich regions and charged regions next to PTB domains and other regions. PTB 
5 domains were compared in the percentage of identity. Figure 3C shows the 

evolution tree of the PTB domains. The alignment from (A) was displayed using 
Seqlab package in GCG program, and the evolution tree was grown graphically. 

Figure 4 shows results of the Genetic Mosaic Analysis for CED'6 (table at 
bottom) and Cell lineage of C. elegans (top). The descendence of both germline 

10 and somatic sheath cells are illustrated. Body wall muscles cells which were used 
to determine the loss of the duplication were also illustrated. The solid square 
indicates the duplication loss in germ cells, and the solid square indicates the 
duplication loss in the somatic sheath cells. The black arrow indicates the somatic 
sheath cell with the enlarged nucleoli in the distal arm of the anterior gonad. The 

15 white arrow indicates the cell corpses accumulated in the proximal arm of anterior 
gonad. 

Figure 5 A-D provide results that showed that heat-shock overexpression of 
CED-6 cDNA rescued the engulfinent defect in both soma and germline. Figure 
5 A shows the cell death during the embryonic development. Shaded box is a 

20 histogragh indicating the number of dying cells every 50 minutes during the 
embryonic development. The arrows indicates the timing of heat shock and the 
timing to observe the engulfinent phenotype. Figure 5B shows the overexpression 
of CED-6 cDNA promotes the engulfment at both the early and the late stage of 
cell death. Transgenic animals carrying the transgene, CED'6 cDNA driven by 

25 heat shock promoter were treated with heat before the cell death occurred at the 
indicated time. Cell Corpses in the head of young LI larvae were examined. The 
animals without the heat treatment were also examined. Other control experiments 
included iV2, CED'6(nl812) with or without heat treatment, and CED-6(nl812) 
carrying lacZ transgene treated with heat. The solid circles indicate the 

30 experiments with the heat shock after the formation of cell corpses, and the empty 
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circles with the heat shock before the cell death took place and the experiments 
without heat shock. Figxire 5C shows the overexpression of CED-6 cDNA rescue 
the engulfinent defect in gemiline. The arrow indicates the timing for a heat shock 
when transgenic animals were at the development stage of the 24 hours after the 
5 L4 molt. Cell corpses were examined at the several time points between the time 
of heat shock and the 60 hours after the heat shock. Figure 5D shows the 
overexpression of CED-6 cDNA promotes the engulfment many hours after the 
formation of the cell corpses in germiine. Adult transgenic animals were treated 
with heat as indicated. Cell corpses were examined in one gonad arm 12 hours 
10 after the heat shock. Control experiments including N2, and CED'6(nl813) are 
indicated in (C). 

Figure 6 presents results that show overexpression of CED'6 partially 
suppresses the engulfment defect of both CED-l and CED-7 during embryonic 
development CED-6 was overexpressed at the genetic background of three alleles 
15 of both CED'l and CED-V. The timing for the heat shock and the timing for the 
examination of cell corpses are illustrated in figure 5 A. Animals with each genetic 
background were treated with heat before the cell death occurred or without the 
heat treatment. Cell corpses were examined in head of young LI larvae. LacZ 
was also expressed in the each genetic background. Each mutant was also treated 
20 with heat shock to examine the effect of heat on the expression of cell corpses. 
Figure 7 is a model of the epistatic pathway for the engulfment genes 
overexpression of CED'6 did not have an obvious effect on the cell corpses 
expression on CED-I, 5 and 10 but on CED-1 and CED-7, We propose that 
CED'6 might act downstream of both CED-1 and C£D-7. And CED-l, 5 and 70 
25 either act in the different pathway or act downstream of CED~6. 

Figure 8 is a flow chart illustrating a Xho I fragment from F56 cosmid 
rescues the CED-6 engulfment defect. 

Figure 9A-B are schematics that illustrate that the C05D2.7 construct is 
CED-6. Figure 9A shows the restriction Map of Xho I fragment and rescue. 
30 Figure 9B shows the transcripts. 
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Figure 10 is a bar graph illustrating that the over-expression of CED-6 
rescues the engulfment defect of the CED-6 mutant. 

Figure 1 1 contains graphs illustrating that the over-expression of CED-6 
rescues the engul&nent defect of CED-6 mutant during embryonic development. 
5 Figure 12 is a bar gr^h illustrating that CED-6 may also promote the 

engulfinent of persisting corpses. 

Figure 13 shows that CED-6 promotes the engulfinent of persistent cell 
corpses and probably acts within engulfing cells. 

Figure 14 is a schematic that shows that CED-6 may be an adaptor protein 
1 0 acting in signal transduction pathway. 

Figure 15 shows graphs which indicate that over-expression of CED-6 
rescues the engulfinent defect in the adult gonad, and CED-6 might act in somatic 
sheath cells. 

Figure 16 illustrates that over-expression of CED-6 partially suppresses the 
15 engulfinent defect of CED-1 mutants. 

Figure 17 shows that the over-expression of CED-6 cDNA suppresses the 
engulfinent defect of CED-7 mutants. 

Figure 18 shows consensus DNA sequence (SEQ ED NO: 7) of hi CED-6 
(2416bp) with start and stop codon in bold and alternatively spliced sequence 
20 underlined. 

Figure 19 shows DNA sequence (SEQ ID NO: 13) of h2CED-6 (alternative 
splice) with start and stop codons in bold. 

Figure 20 shows the amino acid sequence (SEQ ID NO: 8) of hi CED-6 
with alternatively spliced region underlined, 
25 Figure 21 shows the amino acid sequence (SEQ ID NO: 14) of h2CED-6 

(alternative splice). 

Figure 22 shows hlCED-6 cDNA (SEQ ID NO: 7) and hlCED-6 (SEQ ID 
NO: 8) amino acid sequence with PTB domain nucleic (SEQ ID NO: 9) and amino 
acid (SEQ ID NO: 10) sequences, charged region, and proline/serine rich nucleic 
30 acid (SEQ ID NO: 1 1) and amino acid (SEQ ID NO: 12) sequences indicated. 
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Figure 23 shows an alignment of CED-6 and hlCED-6. 
Figure 24 shows an alignment of regions of 47.5% and 3 1 .6% identity, 
respectively. 

Figure 25A Human Multiple Tissue Northem Blot (MTN), Figure 25B 
5 shows a Human Multiple Tissue Northem (MTN) Blot 11, and Figure 25C shows a 
Human Cancer Cell Line Multiple Tissue Northem (MTN™) Blot. The expression 
pattern of hi CED-6 in normal human tissues and cancer cell lines by Northem 
blotting is shown. 

Figure 26 is a map of plasmid pGA3015 in which a CED-6 fragment is 
10 cloned as a C-teraiinal fusion to GFP. 

Figure 27 is a map of plasmid pGA3064 with CED-6 cloned as a C- 
terminal fusion of GFP. 

Figure 28A-28F is a DNA alignment (Genework) of sequenced hbc3123 
EST clone, the PCR fragment I isolated from a cDNA library, and three EST 
15 sequences identified using the PCR fragment- hbc3 123 EST clone was sequenced 
and analyzed. The three EST clones were identified through searching the 
Genbank using the isolated PCR fragment. 

Figure 29 shows the amino acid sequence (SEQ ID NO: 16) of the human 
h3 CED-6, as compared to hlCED-6 (SEQ ID NO: 8). 
20 Figures 30A-B show the nucleic acid sequence (SEQ ID NO: 15) that 

encodes human h3 CED-6, as compared to h I CED-6 (SEQ ID NO: 7). 

Figures 31A-B show that overexprcssion of h3CED-6 rescue an engulfment 
defect. Figures 31 A shows overexpression of hCED-6 rescued the engulfment 
defect of CED'6(nl813) embryos. Embryos laid by transgenic mothers were heat- 
25 shocked before the wave of embryonic cell death, and scored for the numbers of 
persistent cell corpses in head of LI larvae. Each dot represents one animal. 
Figure 3 IB shows overexpression of hCED-6 rescued the germ cell engulfment 
defect of CED'6(nl8I3) animals. Transgenic animals were heat-shocked 36 hours 
after L4/adult molt, and germ cell corpses were scored 12 hours after heat shock. 
30 The number of animals scored is indicated on the top of each bar. 
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Figure 32A-J shows the nucleic acid sequence comparison among ESTs, 
CED-6, hCED-6, and a consensus construction of 2416 bp consensus sequence 
was done by using sequence information obtained from EST RACE & colony 
hybridization. Seq was compiled by using aal 599394 as template and primers as 
5 indicated in multiple alignment. Rcc stands for the reverse complement. Both 
CED-6 and hCED-6 are indicated above the multiple alignment pGAlOl was 
picked up by colony hybridization. 



DETAILED DESCRIPTION OF THE INVENTION 

cDNAs encoding the alternative splice h2CED-6 and the additional 

10 sequence required to constitute h3CED-6 from h2CED-6 have been deposited at 
the Belgian Coordinated Collections of Microorganisms (BCCM) at Laboratorium 
voor Moleculaire Biologic - plasmidencoUective (LMBP), Universiteit Gent, K.L. 
Ledeganckstraat 35, B 9000, Gent, Belgium in accordance with the Budapest 
Treaty on 8th June 1998 and have been accorded the Accession Nos LMBP 3868 

1 5 and LMBP 3869, respectively. 

Primers which will assist in obtaining the relevant inserts from these 
deposits are shown in Example 14. 
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AMINO ACID AND NUCLEOTIDE SEQUENCES 



SEQ. ID NO. 1 


Nucleic acid sequence of C elegans CED-6. (e.g.. Figure 2A) 


SEQ- ID NO. 2 


Amino Acid sequence of C elegans CED-6 (e.g.. Figure 2A) 


SEQ, ID NO. 3 


Nucleotide sequence encoding PTB domain of C elegans CED-6 
(e.g.. Figure 2A) 


SEQ. ID NO. 4 


Amino acid sequence of PTB domain of C elegans CED-6 (e.g.. 
Figure 2A) 


SEQ. ID NO. 5 


Nucleotide sequence encoding proline/serine rich region of C 
elegans CED-6 (e.g.. Figure 2A) 


SEQ. ID NO. 6 


Amino acid sequence of proline/serum rich region of C elegans 
CED-6 (e.g.. Figure 2A) 


SEQ. ID NO. 7 


Nucleotide sequence that encodes hlCED-6 (e.g.. Figure 22, 
Figure 18) 


SEQ. ID NO. 8 


Amino acid sequence of hi CED-6 (e.g.. Figure 20 and Figure 22) 


SEQ. ID NO. 9 


Nucleotide sequence encoding PTB domain of hi CED-6 (e.g.. 
Figure 22) 


SEQ. ID NO. 10 


Amino acid sequence encoding PTB domain of hlCED-6 (e.g., 
Figure 22) 


SEQ. ID NO. 11 


Nucleic acid sequence that encodes the proline/serine rich region 
ofhlCED-6 (e.g.. Figure 22) 


SEQ. ID NO. 12 


Amino acid sequence of the proline/serine rich regions of hlCED- 
6 (e.g. Figure 22) 


SEQ. ID NO. 13 


Nucleotide sequence that encodes h2CED-6 (e.g.. Figure 1 A) 


SEQ. ID NO. 14 


Amino acid sequence of h2CED-6 (e.g.. Figure 21) 


SEQ, ID NO. 15 


Nucleotide sequence encoding h3DEC-6 (e.g. Figure 30A-B) 


SEQ. ID NO. 16 


Amino acid seauence of h3CED-6 fe.s. Fieure 29) 
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C ELEGANS CED'S 

Programmed cell death has traditionally been divided into two distinct, 
sequential processes: cell killing, and the removal of dead cells. However, these 
two events are very closely linked. In vivo, cells that present an apoptotic 
5 morphology are usually akeady engulfed by other cells (Wyllie A. H. et al., 1980 
Int. Rev. Cytol 251-306; Lockshin R,A. (1981) Cell Deadi in Biology and 
Pathology, R.A. Lockshin and LD. Browen, eds. (London: Clapman and Hall), 
pp79-122; Duvall and Wyllie (1986). hnmunol Today 7 pp 1 15-1 19; Robertson 
and Thompson (1982) J. EmbryoL Exp. Morph. ^ pp 89-100; Hedgecock et al 

10 (1983) Science 222, 1277-1279; Ellis et al (1991) Genetics 122 pp 79-94;). 

Engulfment is also a swift and efficient process in the nematode Caenorhabditis 
elegans : dying cells are engulfed and completely removed by their neighboring 
cells within an hour (Sulston and Horvitz, (1977); Dev. Biollfipp 110-156; 
Robertson and Thomson, 1982). The engulfment is not necessarily by professional 

15 phagocytes. Rapid engulfinent of apoptotic cells is important, as it prevents dying 
cells from releasing potentially harmful contents during their lysis, which could 
damage surrounding tissue and result in an inflammatory response (Duvall et al., 
(1985) Immunology 5^ pp 351-358; Savill et al, (1989) J. Clin. Invest. SI pp 
865-875; Grigg et al., (1991) Lancet PP 720-722; Savill et al,, (1993) 

20 Immunol. Today PP 13 1-136). 

The nematode C elegans has been used extensively for the study of 
programmed cell death (reviewed by Hengartner, (1997) Cell Death in C elegans 
n, Plain View, Cold Spring Harbour Laboratory Press, pp 383-415). Genetic 
studies have identified over a dozen genes that function in the regulation and 

25 execution of apoptosis in C. elegans. Six genes - CED-1, CED-2, CED-5, CED-6, 
CED-7, and CED-1 0 - function in the engulfment of all dying cells (Hedgecock et 
al., 1983; Ellis et al., 1991; Horvitz et al., (1994) Cold Spring Harbour Symp. 
Quant Biol (1994) 52: pp 377-385). In animals mutant for any one of these genes, 
many apoptotic cells fail to be engulfed and persist for many hours as highly 

30 refractile disks that can be readily identified under differential interference contrast 
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(DIG) optics (Hedgecock et al, 1983; Ellis et al., 1991). None of the six 
engulfinent genes is absolutely essential for engulfment, as many dying cells are 
still properly removed in these mutants. Genetic analysis of various double 
mutants has suggested that these six genes might fonn two partially redundant 
5 groups, one being comprised of CED-1, CED-6, and CED-7; the other of CED-2, 
CED-5, and CED-10 (Ellis et al., 1991). The number of persistent cell corpses is 
increased dramatically in double mutants crossing groups, but not in those within 
the same group. Understanding how these genes are involved in regulating 
engulfment requires the elucidation of their molecular nature. 

10 In other species, several candidate apoptotic receptors have been identified 

over the past few years; these include the ATP-binding cassette transporter ABCl 
(Luciani and Ournini, (1996), EMBO J. li pp 226-235) adhesion molecules such 
as the vitronectin receptor (Savill et al (1990), Nature M2. pp 170-173) and CD36 
(Asch et al. (1987) J. Clin. Invest. 72 pp 1054-1061; Savill et al (1992) J. Clin. 

15 Invest. 212 pp 1513-1522; Ren et al (1995) L Exp. Med. IS 1857-1862), Drosophila 
croquemort (Franc et al., (1996), Immunity 4, pp 431-443 class A scavenger 
receptors (Piatt et al., (1996), Proc. Natl. Acad. Sci. USA 22 PP 12456-12460) 
lectins (Duvall et al., (1985), and a predicted receptor that can recognize 
phosphatidylserine on the outer leaflet of apoptotic cells (Fadok et al., (1992) J. 

20 Immunol. 14S pp 2207-22 1 6; Fadok et al ( 1 992) J. Immunol 142 pp 4029-4035). 
Currently little is known about the molecules used by engulfing cells to transduce 
signals from surface receptors to the cytoskeleton, or how these molecules regulate 
the local cytoplasmic rearrangements and dynamic extensions that are required for 
phagocj^osis (Savill et al., 1993). A genetic analysis of engulfinent in C elegans 

25 could identify genes involved in these processes. Indeed Wu and Horvitz (1998) 
(Nature 222 PP 501-504) showed that C. elegans CED-5 is homologous to hxunan 
DOCK 180, and might regulate cytoskeleton reairangement during engulfment. 

The process of apoptosis has been implicated in the etiology, or associated 
with the pathology, of a wide range of diseases, including cancer, autoimmune 

30 diseases, various neurodegenerative diseases such as Amyotrophic Lateral 
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Sclerosis, Huntington's Disease, and Alzheimer's Disease, stroke, myocardial 
heart infarct, and AIDS (Thompson, (1995) Science 267 pp 1456-1462). Thus, a 
better understanding of the molecular events that underlie apoptosis might lead to 
novel therapeutic interventions. While much of the current attention is centered on 
5 the genes and proteins that control the killing step of the death process, it is very 
likely that the removal of apoptotic cells will prove to also be crucial for the proper 
overall functioning of the ^optotic program, and will offer another entry point for 
therapeutic intervention (as described herein). 

The process of recognition and engulfment of dying cells is extremely swift 

10 and efficient. In animals, it is essentially impossible to find a cell with apoptotic 
features that is not aheady within another cell. Such rapid recognition and 
phagocytosis of apoptotic cells is a crucial aspect of programmed cell death in 
vivo: xmengulfed apoptotic bodies can undergo secondary necrosis, leading to 
inflammation. Failure to remove apoptotic bodies also exposes the body to novel 

15 epitopes (from e.g., caspase-generated protein fragments), possibly encouraging 
the development of autoimmime disease. Persistent apoptotic bodies can often be 
observed following chemotherapeutic intervention (which leads to extensive 
apoptosis) and are particularly abundant in solid tumors, in which clearance of cell 
corpses might be delayed. 

20 In addition to their ability to recognize and engulf apoptotic cells, 

professional phagocytes carry specific surface receptors, such as the Fc (Ravetch, 
(1994) Cell 2& 553-560; Greenberg et al., (1993) J. Exp, Med. Ul pp 529-534) 
and C3 (Bianco et al., (1975) J. Exp. Med. JUpp 1278-1290; Greenberg, (1995) 
Trends in Cell Biol. 1 pp 93-99) receptors, which recognize antigen-opsonized 

25 particles and trigger their phagocytosis. Inhibitor studies have shown that Fc 
receptor-mediated phagocytosis requires tyrosine phosphorylation (Greenberg et 
aL, 1993; Greenberg, 1995). The work of the present inventors suggests that the 
engulfment of apoptotic cells could be also mediated by a tyrosine kinase signal 
transduction pathway. While these two pathways clearly use distinct receptors at 

30 the cell surface, they must eventually converge on the same downstream 
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engulfinent machinery, and could thus share at least some coiiunon signal 
transduction molecules. 

The invention relates to an isolated protein which is an adaptor molecule in 
a signal transduction pathway which regulates phagocytosis of apoptotic cells. 
5 In a particular embodiment, the invention pertains to an isolated protein 

from the nematode worm C elegans which is an adaptor molecule acting in a 
signal transduction pathway which promotes phagocytosis of apoptotic cells, 
which protein comprises the amino acid sequence shown in Figure 2A (SEQ ID 
No: 2) or an amino acid sequence which differs from Figure 2A only in 
10 conservative amino acid changes. As aforesaid the amino acid sequence shown in 
Figure 2A is that of the C elegans CED-6 protein with its encoding DNA also 
shown. 

In another of the aspects the invention comprises a nucleic acid comprising 
a sequence of nucleotides which encodes the amino acid sequence of Figure 2A, 

15 (SEQ ID No: 2) for example, a sequence of nucleotides from about nucleotide 
position 22 to about nucleotide position 1500 of Figure 2A or the entire sequence 
of nucleotides shown in Figure 2A. 

In a further embodiment of the invention there is provided an isolated 
protein which is a fragment or portion of a protein having the amino acid sequence 

20 of Figure 2 A or of a protein having an amino acid sequence which differs from 
that shown in Figure 2A only in conservative amino acid changes. For example, 
the portion may comprise an amino acid sequence corresponding to the 
phosphotyrosine binding domain (SEQ ID No: 4) (about amino acid 46 to about 
amino acid 193 in Figure 2A) or an amino acid sequence corresponding to the 

25 proline/serine rich region (SEQ ID No: 6) (about amino acid 242 to about amino 
acid 339 in Figure 2A). 

Nucleic acids (SEQ ID Nos: 3 and 5 respectively) encoding the PTB 
domain or the proline/serine rich region of the C elegans CED-6 protein are 
encompassed by the claimed invention. 



wo 99/37770 



PCT/US99/01361 



-16- 

In yet a further aspect of the invention there is provided an isolated nucleic 
acid capable of hybridizing to the sequence of nucleotides of SEQ ID Nos: 1, 3, 5, 
7, 9, 11, 13, 15 under conditions of low, medium or high stringency. It is to be 
understood that low stringency means approximately: 0.2 to 2xSSC; 0.1% SDS; 
5 25'* to 50°C. 

In a further embodiment of the invention there is provided a fusion protein 
which comprises as part of the fusion a protein having an amino sequence of SEQ 
ID No: 2, 4, 6, 8, 10, 12, 14, or 16 or an amino acid sequence which differs from 
the amino acid sequence shown in SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, or 16 only in 

10 conservative amino acid changes. The protein may be fused to, for example, an 
epitope tag or the expression product of a reporter gene. 

In yet a further aspect the invention provides expression vectors comprising 
any of the nucleic acid sequences of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, 15. 
Preferably, the vectors incorporate a reporter gene such as green fluorescent 

15 protein which is positioned relative to the nucleic acid of the invention such that 
expression of the nucleic acid results in expression of the reporter gene. 
Preferably, a fusion of CED-6 and the reporter gene is expressed. 

It is to be understood that the term ''nucleic acid" as used herein may 
include genomic DNA, RNA and cDNA. 

20 Positional cloning methods were used to clone the C elegans CED-6 gene 

and determine the nucleotide sequence. In addition they have functionally 
characterized the protein. By searching publicly available protein sequence 
databases, it has been determined that the CED-6 protein has in the N-terminal half 
a putative phosphotyrosine binding domain and in the C-termrnal half a 

25 proline/serine rich region which is a potential SH3 binding domain. 

Genetic mosaic analysis, as well as rescue and over-expression 
experiments, have shown that CED-6 acts autonomously within engulfing cells 
and promotes engulfment of apoptotic cells. Further database searching has 
confirmed the functional regions to be surprisingly evolutionally conserved. Thus, 
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the inventors have now cloned two human homologues of the C elegans CED-6 
gene and shown them to have equivalent function. 
Molecular Cloning of C elegans CED-6 

Previous genetic mapping experiments by Ellis and Colleagues (Ellis et al, 
5 (1991) (Genetics 129 pp 79-94) have placed CED-6 gene close to the daf-4 locus 
on chromosome three (Figure lA). The region around daf-'4 has been mostly 
sequenced by the C elegans genome sequence consortium (Wilson et al, (1994) 
Nature 368 pp 32-38), To determine the exact physical location of CED-d, the 
present inventors collected thirteen overlapping cosmids in this region which 

10 together are roughly 0.3 Mbp, Using the germline transformation method (Mello 
and Fire, (1995), methods in cell biology (San Diego Academic Press) pp 452- 
482)these cosmids were tested for their ability to rescue the engulfment defect of 
CED-6(nl813), by scoring three-fold embryos laid by transgenic animals for the 
presence of persistent cell corpses. Three fold embryos were chosen for the initial 

15 study because cell corpses are numerous and easily seen at this stage of 

development. Two overlapping cosmids F56D2 and F43F12 were found to be 
able to rescue the engulfinent defect of CED-6(nl813), The further rescuing 
experiments using the DNA fragments from F56D2 were identified to contain the 
rescuing activity. 

20 The gene prediction program GENEFINDER™ suggested that this region 

contains two genes, which the C elegans genome sequence consortium submitted 
to Genbank under the names F56D2.7 and C05D2.6. Using a combination of RT- 
PCR and screening of cDNA libraries (see below) the existence and predicted 
intron/exon pattern of F56D2.7 was confirmed. However, the inventors found that 

25 C05D2.6, rather than corresponding to a single gene, actually corresponds to two 
genes and the short distance (»>bp) between the end of the upstream transcript 
and the start of the downstream transcript suggested that C05D2.6A/B might be a 
two-gene operon (Zorio et al (1994) Nature 372 pp 270-272.). It was found that 
C05D2.6B is trans-spliced to the "downstream" splice leader SL2, whereas the 
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Upstream transcript C05D2,6A is trans-spliced to the more common SLl splice 
leader (Figure IE). 

The CED-6 Locus 

To determine which one of the three genes present on the Xho I fragment 
5 corresponds to CED-d^ a number of constructs were generated containing internal 
deletions or point mutations. The deletion of most of the C05D2.6A/B operon had 
no deleterious effect on CED-6 rescue, whereas the introduction of a frameshift 
mutation within exon 3 of F56D2.7 abohshed the fragment's rescuing activity 
(Figure IE). To exclude the possibility that F56D2.7 might be a multicopy 

10 suppressor of CED-6 j and to confirm suspicions that F56D2.7 might correspond to 
the CED-6 locus, the two known CED-6 alleles, nl813 and n2095 were analysed 
for any nucleotide changes within this region. Southern blot analysis revealed an 
allete-specific restriction fragment length polymorphism affecting F56D2.7 in 
CED-6(n2095) mutants (Figure 2A). Based on the hybridization patterns observed 

15 in n2095, a single nucleotide deletion in exon 4 of F56D2.7 in CED-6(nl813) was 
also identified. This mutation should result in a reading frame shift and a 
truncated protein (Figure 2B). Taken together, the genomic rescue and mutation 
data strongly suggested that F56D2,7 corresponded to CED-6. 

Identification ot CED-6 Transcripts 

20 To confirm the predicted intron/exon structure for CED-6, the present 

inventor screened a mixed-stage cDNA library and identified 10 clones 
corresponding the CED-6 gene. Several of these contained splice leader SL2 
sequences at the 5' end, suggesting that CED-6 might also be a downstream gene 
in an operon. RT-PCR was performed on mixed-stage RNA using both SLl and 

25 SL2 trans-splicing leaders as primers for the PGR step. Interestingly, sequence 
analysis of the PCR-amplified fragments revealed that both SLl and SL2 trans- 
splicing leaders can be found at the 5' end of CED-6 transcripts (Figure 23). The 
upstream gene in the CED-6 operon is the predicted gene F56D2.1, The presence 
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of SLl -trans-spiced mRNA suggests that CED'6 might also be transcribed from a 
second downstreana promoter, independently of the upstream gene. The existence 
of a downstream promoter could explain why the Xho I fragment could rescue 
CED'6 mutants even though it does not contain the whole CED~6 operon. 

5 CED-6 Protein Contains a Phosphotyrosine Binding (PTB) Domain and a 
Proline/Serine Rich Region 

The full-length CED-6 cDNA is predicted to code for a 492 amino acid 
protein (Figure 2B). A search of public sequence database with the predicted 
CED-6 sequence indicated that the N-terminal half of CED-6 contains a putative 

10 phospho-tyrosine binding (PTB) domain. PTB domains can promote binding to 
phosphorylated tyrosine residues located within an appropriate primary sequence 
context. The PTB domain is similar in function, but distinct in structure from the 
SH2 domain. The present inventors have aligned the CED-6 PTB domain with the 
PTB domains found in a number of other proteins (Figure 3 A). Secondary 

15 structure prediction programs suggest that most of these structural elements also 
exist in the CED-6 PTB domain. 

In addition to its similarity to known proteins, the CED-6 PTB domain also 
showed significant sequence similarity to the predicted translation products of a 
number of expressed sequence tags (ESTs; Figure 3 A, B). In fact, the degree of 

20 similarity between CED-6 and a number of these ESTs was much higher than 
between CED-6 and any previously characterized protein (Figure 3 A, 3B). 
Furthermore, in several cases, the sequence similarity between CED-6 and ESTs 
extended beyond the PTB domain (Figure 3B). CED-6 also contains a 
proline/serine rich region at its C-terminal half, with 42% serine over a 24 amino 

25 acids stretch and clusters of proUne-rich regions (Figure 2B, Figure 3B). These 
proline-rich regions were characterized by several sequence signatures of PxxP 
(Figure 2A), which has been shown to promote interaction with SH3 domains 
(Ren et al, (1993); Yu et al (1994) Cell 76 pp 933-945,; Grabs et al (1997) X Biol, 
Chem. 272 pp 13419-13425). Between the PTB and proline-rich regions is a short 
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Stretch rich in charged residues(41% charged amino acids over 46 amino acids). 
This highly charged region is also found in several other PTB domain containing 
proteins, including mouse p96. She, and C elegans Ml 10.5 (Figure 3B). 

■ 

Conservation of CED-6 Amongst Species 
5 It was found that these EST clones also shared the homology region 

beyond the PTB domain with the CED-6 protein. A C Briggsae EST clone has 
72% identity to CED-6 over 132 amino acids at the N-terminus, and 64% identity 
to CED-6 over 103 amino acids at the C-terminus (Figure 3B). Three overlapping 
human EST clones were also obtained and constructed into one sequence. The 

10 human EST fusion sequence showed -54% identity to PTB domain of CED-6, and 
also contains a highly charged region right after the PTB domain. The evolution 
tree based on the alignment of PTB domains showed that CED-6 formed a 
subgroup with EST clones from human, Drosophila, and C Briggsae, suggesting 
that these proteins might be functionally conserved. Mouse p96, Drosophila 

15 Disabled^ and C elegans Ml 10.5 formed another subgroup (Figure 3C). The tree 
also indicated that the She subgroup is more similar than the p96 subgroup to 
CED-6 subgroup. 

CED-d Acts Cell-autonomously Within Engulfing Cells 

A genetic mosaic analysis was performed to determine if CED-6 acts 

20 within engulfing cells or dying cells. For convenience, a pair of cells on adult 
gonad, germ cells and somatic sheath cells (Figure 4A) were used. During 
oogenesis large number of oocytes undergo programmed cell death, and normally 
these dying cells are engulfed by somatic sheath cells (Hengartner,1997). In this 
analysis a mosaic pattern of genetic background for CED-6 and wild type between 

25 germ cells and somatic sheath cells was generated. Ncl-1 mutant was used for the 
identification of the mosaic pattern in the single-cell resolution since in the Ncl-1 
mutant somatic cells of animals exhibit abnormal enlarged nucleoli, which can be 
easily identified under Normaski optics (Herman, 1984; Genetics 108 pp 165-189; 
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Hedgecock and Herman, 1995 Genetics 141 pp 989-1006), A strain was 
constructed dpy-17(el64) CED-6 (nl813) mec^I4(u55) ncl-I(eJ865) 
unc'36(e25 1)111; sDp3. This worm strain showed a wild type phenotype since the 
sDp3(III;f) duplication covers all these mutations (Rosenbluth et al, (1985) 
5 Genetics 109 pp 493-51 1). To identify die animals with CED'6 mutant germ cells 
and wild-type somatic sheath cells, animals must be found with the duplication 
loss from any of P2, P3 and P4 lineages but not from EMS, MS or any lineages 
below the MS which would lead to the loss of the duplication in somatic sheath 
cells (Figure 4). These animals can be obtamed by looking through many animals 

10 of the constructed strain for the animals laying only Dpy Unc progenies. The 
animals with the loss of the duplication in PI lineage also lay only the Dpy Unc 
progenies, however these animals are not mosaic animals for the present purpose 
since the loss of the duplication in PI lineage results in the CED-6 mutant 
background in both germ cells and somatic sheath cells. From 1,000 dpy'17(eI64) 

15 CED'6(nl813) meC'14(u55) ncl'l(el865) unc-36(e25 1)111; sDp3 animals, six 
animals were identified laying only Dpy Unc progenies. Observation of these six 
animals under Normaski optics indicated that one animal had the duplication lost 
in P4, one in P3, three in P2, and one in PI . All five animals displayed no cell 
corpses in gonad except the one with the duplication lost in PI, suggesting that 

20 CED-6 is not required in germline for engulfment. Since the chance for loss of the 
duplication in all cell divisions is approximately the same (Hedgecock and 
Herman, 1995), the rate of the sDp3 loss is 0.15% per ceil division. Animals were 
then looked for with the CED-6 mutant somatic sheath cells and wild-type germ 
cells. From 500 animals four animals were identified with enlarged nucleoli in the 

25 somatic sheath cells in one arm of the gonad (Figure 5B), and all four animals did 
not have the duplication lost in the lineage generating germ ceils (Figure 4). Three 
animals appeared to have the duplication lost in sheath cells in the anterior arm but 
not in the posterior arm. And the accumulated cell corpses were only observed 
within the anterior gonad arm, but not the posterior gonad arm of these animals 

30 (Figure 4, Table). One animal had the duplication lost in the sheath cells 



wo 99/37770 



PCT/US99/01361 



-22- 

surrounding the posterior gonad arm, but not in that surrounding the anterior arm. 
This animal had cell corpses accumulated within the posterior arm but not the 
anterior arm (Figure 4), These results suggest that CED-6 is required for somatic 
sheath cells, or engulfing cells to eliminate the dying cells in adult gonad. 

5 CED-6 Promotes the Engulfinent of Embryonic and Germ Cell Corpses 

To unambiguously demonstrate that F56D2,7 cDNA indeed corresponds to 
CED-6, the inventors tested whether the full-length F56D2.7 cDNA can rescue the 
engulfment defect of CED'6 mutants, and transgenic animals were generated 
carrying the F56D2.7 cDNA under the control of the C elegans heat shock 

10 promoters hsp-16.2 and hsp-16.48 (see Examples) Used together, these two 
promoters drive expression in almost all somatic cells, including both cells that 
normally undergo programmed cell death and ceils that normally engulf the dying 
cells. To test for rescue, embryos laid by transgenic mothers were exposed to a 
brief heat shock pulse just prior to the appearance of the first developmental cell 

15 deaths, and scored the number of persistent corpses visible in the heat-shocked 
animals after hatchuig (Figure 4). As expected, over-expression of F56D2.7 
cDNA significantly and specifically reduced the number of persistent cell corpses 
visible in CED-6 mutants, confirming that F56D2.7 is the relevant gene affected 
by the mutations that we detected in CED-6(nl813) and CED-6(n2095) mutants. 

20 Rescue of F56D2.7 cDNA in germhne was also tested (Figure 5C). Aduh 
hermaphrodites were exposed to a brief heat shock pulse just prior to the 
appearance of the germiine cell death, and scored the number of persistent cell 
corpses 12 hours and beyond after the heat shock. No cell corpses were found in 
gonads of the majority of animals, suggesting that CED-6 cDNA can also rescue 

25 the engulfment defect of CED-6 in germiine. 

Recognition and engulfinent of apoptotic cells is a very early event in C. 
elegans programmed cell death (Robertson and Thomson, (1982) J, Embryol. Ex, 
Morph 67 pp 89-100). In CED-6 mutants, the extension of cytoplasm is blocked, 
resulting in the persistence of cell corpses (Ellis et al, 1991). These cell corpses, 
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however disappeared jfrom the animal eventually . To determine whether CED-6 
acts only in a narrow time-window at the early stage of cell death or whether the 
signal transduction pathway can be used to engulf cell corpses formed many hours 
after cell death takes place, the inventors tested whether F56D2.7 cDNA promotes 
5 the engulfinent of persistent cell corpses. CED-6 was over-expressed three hours 
before the embryos hatch, when most of cells dying by programmed cell death 
during the embryonic development have been dead approximately for five hours 
(Figure 5A), and examined cell corpses three hours after the heat-shock on the 
head of LI larvae. The number of cell corpses was found to be suppressed 

10 significantly (Figxire 5B). The control experiments with either no heat treatment, 
or over-expression of lacZ showed no obvious effect on the corpse expression, 
suggesting that over-expression of CED-6 can promote the engulfinent of cell 
corpses in soma (Figure 5B). The inventors also tested if over-expression of 
CED'6 could promote the engulfment of cell corpses formed hours after the cell 

15 death in the germline (Figure 5D). Adult transgenic animals carrying CED-6 
cDNA driven by the heat shock promoters were heat structured at several time 
points after the accumulation of cell corpses in gonad and the number of cell 
corpses 12 hours after the heat shock were examined. It was found that cell corpses 
could be removed sufficiently at all time points, suggesting that over-expression of 

20 CED-6 can promote the engulfinent of cell corpses accumulated in germline for 
hours, even days (Figure 5D). The present inventors have concluded that the 
signal transduction pathway in which CED-6 is involved can carry on the task of 
removing cell corpses, and there is no specific time-window for CED-6 to act 
during the process of programmed cell death. 

25 Mosaic CED-6 Protein Expression Supports That CED-6 Acts Within Engulfing 
Cells 

The invention includes methods to detect quickly if CED-6 acts within 
engulfing cells. This method is based on dying cells' failing to express proteins so 
as to generate a mosaic pattern of protein expression. However, this idea can be 
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only applied to the soma, but might not to the germline, since in germline all germ 
cells share one syncytial cytoplasm (Hirsh et al, (1976) Developmental Biology 49 
pp 200-210), so those genn cells carrying the transgenes could contribute the 
expressed proteins into the cytoplasm, subsequently all newly formed oocytes. 
5 However the mosaic pattern of the protein expression can be generated in the 

germline because the transgenes have been found not to be expressed well in germ 
cells. The expression pattern of heat shock promoters in gonad were examined. 
Adult animals carrying the lacZ transgenes driven by heat shock promoter were 
applied heat shock 24 hours after L4 molts. The lacZ expression by beta-gal 

10 staining in both germ cells and sheath cells was subsquently examined. It was 
found that somatic sheath cells were stained blue and the stain could last 60 hours 
after the heat shock, but not the germline at any time point after the heat shock, the 
similar result was also observed in previous studies (Stringham et al, (1992) 
Molecular Biology of the cell 3 221-233). The expression of CED-d in germline 

15 upon heat shock was also examined for three- fold embryos laid by heat-treated 
transgenic animals for the rescuing activity of the engulfinent defect. It was found 
that the majority of embryos had the CED~6 mutant phenotype, suggesting that 
CED-6 is not expressed well in germline. That CED-6 transgene in gonad is not 
expressed very well provided a useful tool to test if CED-6 acts within the somatic 

20 sheath cells. As described in Figures 4 and 5, cell corpses were not observed in 
majority of animals in gonad at the different time point after the heat treatment, 
and the phenomenon lasted until 60 hours or beyond after the heat treatment (5C). 
In contrast to this result, without the heat treatment these transgenic animals had 
cell corpses accumulated in gonad, similar to that of the CED-6(nl813) mutant. 

25 Over-expression of lacZ didn't affect the expression of cell corpses of CED-6 

mutant, either (5C). These results support the conclusion from the mosaic analysis 
that CED-6 might act within engulfing cells, the somatic sheath cells. This method 
provides a simple way to detect if a gene acts within engulfing cells or dying cells. 
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Site of active of CED-.6 in relation to CED-l and CED-7 

To understand if CED-6 genetically interacts with any other engulfment 
genes, CED-6 was over-espressed at the genetic background of CED-L 7, 2, J, 
and 10. The extra-chromosomal arrays carrying CED'6 cDNA driven by heat 
5 shock promoters were transferred from CED'6(nl813) background to wild-type 
N2 background, and subsequently to CED-1, 7, 2, J, and 10 mutant background. 
CED-6 was then over-exposed by following the method used for the rescue of 
CED-6 engulfinent defect by the over-expression of CED-6 cDNA as described in 
Figure 5A. It was found that over-expression of CED-6 could partially suppress 

10 the enguljBment defect for CED-7(nl997). To understand if the suppression is 
allele-specific, two additional alleles, CED-7(nl996) and CED'7(nl892), were 
tested and similar results were achieved, suggesting that the suppression is not 
allele-specific (Figure 6). For the same purpose three alleles of CED-1, nl506. 
nl995, and nl735, were also tested it was found that over-expression of CED'6 

15 could partially suppress the engulfment defect of three alleles of CED-l (Figure 6). 
Several control experiments were performed to confirm that these rescue were 
specific for CED-6, Transgenic animals with CED-6 transgene without heat 
treatment were tested; over-expression of lacZ at CED-1 or CED-7 engulfment 
mutant background was also tested. Results showed that the similar numbers of 

20 cell corpses were achieved as that of the CED-1 or CED-7 mutants. Heat 
treatment reduced the expression of cell corpses for CED- 7(nl997). Over- 
expression of CED-6 reduced the expression of cell corpses even more. These 
data suggest that the partial suppression of the engulfment defect of both CED-1 
and CED-7 are specific for CED-6, It was also observed that over-expression of 

25 CED-6 did not have obvious effect on the number of cell corpses for CED-2, 5 and 
10, These results suggested that CED-6 might act downstream of both CED-1 and 
CED-7, and CED-2, 5 and 10 act either downstream of CED-1, 6, and 7 or in a 
different pathway (Figure 6). 
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The Regulation of the CED-d Expression 

SL2 was detected at the 5' end of the CED~6 cDNA, suggesting that CED-6 
is a downstream gene of an operon (Huang and Hirsh, (1989); Proc. Natl, Acad. 
ScLUSA 86 pp 8640-8644; Spieth et al (1993) Cell 73 pp 521-532; Zorio et al 
5 (1994) Nature 372 pp 270-272; Blumenthal et al (1995) TIG II pp 132-136). The 
inventors have shown previously that a 1 0 kb Xho I jfragment can rescue the 
enguifment defect of the CED-6 mutant. The fragment, however contains only 
CED-6, the downstream gene of an operon, but not the upstream one. The 
expression of CED-6 might rely on the 1 kb upstream region of CED-6 gene, a 
10 intergenic region of the operon. The Intergenic region of a operon sometimes 
could be used as a promoter for the expression of the downstream gene 
(Blemcnthal and Steward, (1997 C.elegans II) (Cold Spring Harbor; Cold Spring 
Harbor Laboratory Press pp 1 17-145) 

CED-6 is an Adaptor Molecule Acting in the Signal Transduction Pathway of the 
15 Engulfinent 

Protein phosphorylation is a well-defined "switch" mechanism for cells to 
deliver signals from one protein to another, and it is essential to transduce 
extracellular signals inside cells. PTB domain is another domain besides the SH2 
domain to be able to interact with a phosphoryiated tyrosine residue (Kavanaugh 

20 and Williams, (1994) Science 266; Blaikie et al, (1994) J.Biol.Chem 269 32031- 
32034). Several proteins containing PTB domains have been found to act as 
adaptor molecules in the signal transduction pathway. These include She, Sck, 
Numb, FE65, disabled, DOC-2, P96 and IRS-1 (Bork and Margolis, (1995) Cell 
80 pp 693-694); Geer and Pawson, (1995) TIBS 20 pp 277-280). The proline rich 

25 region from many proteins have been shown to form multiproline helix and 

interact with a SH3 domain (Ren et al, 1993; Gout et al, (1993) Cell 75 pp 25-36; 
Yu et al, 1994). Both biological analysis and analysis of the crystal structure of 
the SH3 binding domain suggested that the sequence signature, PxxP, was 
essential for its interaction with the SH3 domain (Ren et al, 1993; Yu et al, 1994; 
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Grabs et al, 1997). CED-6 contained stretches of proline rich regions containing 
the PxxP signature, suggesting its potential to interact with the SH3 domain. 
CED-6 is an adaptor molecule that directly or indirectly transduces the signal from 
receptors to effectors or cytoskeleton molecules to initiate the engulfment process. 

5 The Interaction Partners of CED-6 

The PTB domain has been shown to interact specifically with a NPXY(p) 
motif (Kavanaugh and Williams, 1994; Zhou et al, (1995) Nature 378 pp 584-592; 
Geer and Pawson, 1995). Many receptors such as EGF receptor, TrkA, insulin 
receptor, IGF-1 receptor contain this motif at the carboxyl terminal (Geer and 

10 Pawson, 1995). Signals from these receptors have been shown to be transduced 
through the interaction of a phosphotyrosine residue of ±is motif with PTB 
domains of adaptor molecules, such as She and insulin receptor substrate 1. The 
inventors found that in the intracellular region of CED-7 there was a NPXY(p) 
motif. CED'7 has been suggested to act in the same genetic pathway with CED-d 

15 (Ellis et al, 1991). The inventors have shown that CED-J might act upstream of 
CED-6 (Figure 7). CED-7 encodes a ABC transporter, and its mammalian 
homologue, ABCl was found to be required for the macrophage to engulf dying 
cells (Luciani and Chimini, 1996), suggesting that CED-? might act within 
engulfing cells. It is possible for CED-6 to physically interact with CED-7 

20 through a PTB domain with NPXY(p) motif of CED-7 to regulate the signal 
transduction of engulfment process. 

CED-6 also contains a proline/serine rich region with several sequence 
signature PxxP, which might mediate its interaction with the SH3 domain. The 
SH3 domain has been suggested to mediate protein-protein interactions between 

25 signaling molecules downstream of membrane-bound receptors (Koch et al, (1991) 
Science 252 pp 252-673; Pawson and Schlessinger, (1993) Current Biology 3 pp 
434.442. A SH3 domain containing protein is likely to interact with CED-6 and to 
regulate the signal transduction pathway of engulfment. Several proteins might 
directly or indirectly interact with CED-6 protein. CED-l might act upstream of 
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C£Z)-6(Figure 6 & 7A). The relationship between CED-1 and CED-6 will depend 
on the cloning of the gene. A protein with a phosphorylated tyrosine residue 
should exist to interact with the PTB domain of CED-6. This phosphorylated 
protein is either a tyrosine kinase or a substrate of a tyrosine kinase, and a tyrosine 
5 phosphatase should also be involved in the signal transduction pathway of 
engulftnent to down-regulate the activity of the phosphorylated proteins. Some 
studies on phagocytosis in mammalian system have shown that a tyrosine kinase 
signal transduction pathway might play an essential role in the opsonin-mediated 
phagocytosis process (Roshenshine and Finlay, (1993) BioEssays 15 pp 17-24; 
10 Greenberg, (1995) Trends in Cell Biology 5 pp 93-99. The present results suggest 
that it might be the same case for the PCD triggered engulftnent. These two types 
of phagocytosis might share some similarity at the end. 

CED-6 Acts Within Engulfing Cells 

A genetic mosaic analysis has been performed to determine that CED-6 

15 acts within engulfing cells. This conclusion was drawn based on the observation 
of a pair of cells, germ cells and somatic sheath cells. We have shown previously 
that over-expression of CED'6 can promote the engulfinent of cell corpses. Since 
cells that have been dead for many hours are very unlikely to maintain their ability 
for protein expression (Estus, 1994; Freeman, 1994), the rescue of cell corpses is 

20 most likely to be due to the expression of CED-6 within the engulfing cells. This 
result suggests that CED-6 also acts within the engulfing cells in the soma. 
Previously it has been shown by the inventors that over-expression of CED-6 
could rescue the engulfment defect of CED'6 in both soma and germline (Figure 
5), suggesting that CED'6 acts in a similar mechanism in both places. 



25 CED-6 Can Promote the Engulfment of Cell Corpses 

Over-expression of C£Z)-6 promotes the engulfment of dying cells at a 
very early stage of the cell death, and cell corpses formed hours after the cell 
death. Cell corpses have been shown to have a typical morphology of apoptotic 
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cells, for instance, membrane blebing. The antigens presented on the membrane 
surface of cell corpses for their recognition by engulfing cells might be somewhat 
different from that on the membrane surface of the early dying cells. Irrespective 
of ligands on dying cells and receptors on the engulfing cells are the same or not in 
5 both situations, CED-6 is required for the engulfment. A few cell corpses in the 
gonad were not removed upon heat shock for some animals later after the heat 
shock. These corpses tend to be located in between oocj^es and closed to the 
spermatheca The failure of the engulfment of these cell corpses might be due to 
their lack of contact with the sheath cells. It is concluded that cell corpses, just 

1 0 like dying cells at the early stage of the PCD, can trigger phagocytosis. In mec-4 
mutant animals six touch sensory neurons die of necrotic death due to a channel 
defect leading to an impaired osmotic pressure in these cells (Driscoll and Chalfie, 
(1991) Nature 349 pp 588-593). Chung and Driscoll showed that the removal of 
the swelling dead cells was delayed significantly at the CED'6 background, 

15 implying that CED-6 is also involved in the removal of necrotic dying cells. Thus, 
there might be similar signals presented on the surface of dead cells to allow them 
to be recognized by engulfing cells regardless the manner of the death; and the 
signal transduction pathway in which CED-6 is involved can be used to respond to 
these signals to cause engulfment. The fact that engulfment is triggered so early 

20 and is completed so swiftly is a clever design of nature, it is important especially 
for tissues with massive cell death. 

Conservation of the Engulfment Program 

In an alignment, an EST clone from C. Briggsae is highly conserved with 
CED-6 in both the N- and C-terminal region, suggesting that this EST clone might 
25 represent a real CED-6 homologue (Figure 3B). EST clones for Drosophila and 
human are also highly conserved to CED-6 but mainly in the region of PTB 
domain (Figure 3A & 3B). This result suggested the possibility for these PTB 
domain proteins to be functional homologues of CED-6 in those specimens. As a 
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result two human homologues of C.elegans CED-6 gene have been cloned and 
characterized. 

Expression Vectors and Transfected Mammalian Cells Expressing CED-6 

Fragments of C.elegans CED-6 DNA was inserted into commercially 
5 available vectors, including vectors having the reporter gene, green fluorescent 
protein (GFP), are set out in table 1 below; 

TABLE 1 

GFP-CED-6 expression in MCF7 
Cloning of CED-6 fragments in pEGFP 





from ... (bp) - to (b-) 


Vector 


2-1591 


22-1492 


598-1581 


598-1494 


22-745 


744-1581 


744-1494 


TA-PCR 


pGAl 


pGA2 


pGA3 


pGA4 


pGA5 


pGA6 


pGA7 


pAS2 


pGAiOll 




pGAlOU 










PGAD414 
















pEGFP-Cl(*) 


pGA301 1 




pGA30l3 




pGA3015 






pEGFP-C3(*) 












pGA3036 




pEGFP-N3(*) 










pGA3045 






pEGFP-N3(*) 




PGA3062 




pGA3064 






pGA3067 



*are commercially available from Clontech 



Visualization GFP fluorescence in MCF7 cells 
20 Human breast cancer cells, MCF7 (ATCC: HTB-22), were seeded in Lab 

Tek chambered coverglass (Nalge Nunc International) and transfected using 
lipofectAMINE (GibcoBRL). After 18 hours, the chambered coverglasses where 
placed on a inverted microscope, and GFP fluorescence could be visualized. 



Expression of GFP-CED-6 
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Subcellular localization of worm CED-6 was assayed using GFP fusion 
proteins. By using different fragments the inventors showed that CED-6 has a clear 
cytoplasmic localization. This localization was abolished when only the PTB of 
CED-6 was used indicating that the C-terminal part might be implicated in proper 
5 targeting. Since the actual expression level varies from cell to cell one can observe 
an apoptotic phenotype in highly expressing cells and an elevated level of 
phagocytosis in strong expressing cells. In addition, locahzation to the lamelli was 
observed in some cells which perform engulfinent. 

The transfected MCF7 cells as above are useful for conducting assays to 
10 identify compounds which inhibit and enhance CED-6 or CED-6 as will be 
discussed hereafter. 

Human Homologues of C. Elegans CED-6 

In accordance with the invention there is provided an isolated protein which 
is an adaptor molecule in a signal transduction pathway which regulates 
1 5 phagocytosis of apoptotic cells. 

In accordance with another embodiment of the invention there is provided an 
isolated protein which is a human homologue of C. elegans CED-6 which comprises 
an amino acid sequence as shown in Figure 20 or Figure 22 (SEQ ID No: 8) or an 
anfiino acid sequence which differs from that shown in Figure 20 only in 
20 conservative amino acid changes (hi CED-6). 

Also provided is a nucleic acid (DNA RNA, cDNA or genomic DNA; SEQ 
ID NO: 7, 13, 15) encoding hlCED-6, h2CED-6 or h3CED-6 (SEQ ID Nos; 8, 14, 
16) or a functional equivalent thereof. For example the invention encompasses a 
nucleic acid comprising the sequence of nucleotides from about nucleotide position 
25 430 to about nucleotide position 1344 shown in Figure 18, Figure 19, or Figure 22 
or the entire sequence of nucleotides shown in these figures. The invention includes 
the open reading frame of the nucleic acid sequence that encodes c. elegans CED-6, 
CED-6, h2CED-6 or h3CED-6. 

The invention also provides a protein which is a fragment of the protein with 
30 the amino acid sequence shown in Figure 20, Figure 22 or Figure 29 (SEQ ID No: 8, 
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14, 16). The fragment may comprise a sequence of amino acids corresponding to 
the phosphotyrosine binding domain of SEQ ID NOs: 8, 14, 16. For example, the 
PTB domain of SEQ ID Nos: 8 or 16 is from amino acids Nos. 15-157. The 
invention also pertains to the nucleic acid and amino acid sequences of the 
5 proline/serine rich domains of hlCED-6 and/or h3CED-6 (e.g., amino acid Nos.: 
201-276 in Figures 20, 22, or 29). Similarly, the highly charged region of SEQ ID 
NOs.: 8 or 16 is encompassed by the invention (e.g., amino acid Nos. 161-195 of 
Figures 20, 22 and 29). The invention includes the nucleic acid sequences that 
encode these fragments. 

10 There is also identified herein a splice variant of h3CED-6 (referred to herein 

as h2CED-6) which variant comprises an amino acid sequence as shown in Figure 
21 (SEQ ID No: 14) or an amino acid sequence which differs from that shown in 
Figure 21 only in conservative amino acid changes. Also provided is a nucleic acid 
(DNA, RNA, cDNA or genomic DNA) encoding h2CED-6 (SEQ ID No: 13) or a 

15 ftmctional equivalent thereof, for example a nucleic acid comprising from about 
nucleotide position 430 to about nucleotide position 1206 in Figure 19 or the entire 
nucleotide sequence shown in Figure 19. (SEQ ID No: 13) 

The human CED-6 amino acid sequence (SEQ ID NO: 16) is also shown in 
Figure 26. Amino acid sequence SEQ ID NO: 16 (human CED-6) and SEQ ID NO: 

20 8 (hlCED-6) differ at amino acid No. 150. The nucleic acid sequence (SEQ ID NO: 
15) that encodes human CED-6 is shown in Figure 30A-B. The claimed invention 
includes SEQ ID NOs: 15 and/or 16, the open reading frame of SEQ ID NO.; 15, 
and the nuclic acid and amino acid sequence that encoded the functional fragments, 
(e.g., serine/ protein rich region, the PTB domain or the highly charged domain), as 

25 described herein. 

The invention also provides a fusion protein in which one part of the fusion 
is a protein having an amino acid sequence as shown in any of SEQ ID Nos: 8, 14 or 
16 or a sequence differing from acid sequences only in conservative amino acid 
changes. The protein may be fused with, for example, an epitope tag or expression 

30 product of a reporter gene. 
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The present invention is intended to encompass CED-6 proteins (e.g., C 
elegans CED-6, hi CED-6, h2 CED-6 and/or h3 CED-6) and polypeptides having 
amino acid sequences analogous to the amino acid sequences of CED-6. Such 
polypeptides are defined herein as CED-6 analogs (e.g., homologues), orthologs, or 
5 mutants or derivatives. Analogous amino acid sequences are defined herein to mean 
amino acid sequences with sufficient identity of CED-6 (e.g., C elegans CED-6, 
hi CED-6, h2CED-6 orh3CED-6) amino acid sequence to possess the biological 
activity of CED-6. For example, an analog polypeptide can be produced with 
"silent" changes in the amino acid sequence wherein one, or more, amino acid 

10 residues differ fi-om the amino acid residues of the CED-6, yet still possesses the 
biological activity of CED-6. Examples of such differences include additions, 
deletions or substitutions of residues of the amino acid sequence of CED-6. Also 
encompassed by the present invention are analogous polypeptides that exhibit 
greater, or lesser, biological activity of the CED-6 proteins of the present invention. 

15 The claimed CED-6 protein and nucleic acid sequences include homologues, 

as defined herein. The homologous proteins and nucleic acid sequences can be 
detennined using mettiods known to those of skill in the art. Initial homology 
searches can be performed at NCBI against the GenBank (release 87.0), EMBL 
(release 39.0), dbEST SwissProt (release 30.0) databases using the BLAST network 

20 service and other EST databases. Altshul, SF, et al, Basic Local Alignment Search 
Tool- J. Mol. Biol 215: 403 (1990), the teachings of which are incorporated herein 
by reference. Computer analysis of nucleotide sequences can be performed using 
the MOTIFS and the FindPattems subroutines of the Genetics Computing Group 
(GCG, version 8.0) software. Protein and/or nucleotide comparisons can also be 

25 performed according to Higgins and Sharp (Higgins, D.G. and P.M. Sharp, 
"Description of the method used in CLUSTAL," Gene. 73: 237-244 (1988)). 
Homologous proteins and/or nucleic acid sequences to the CED-6 protein and/or 
nucleic acid sequences that encode the CED-6 protein are defined as those molecules 
with greater than 70% sequences identity and/or similarity (e.g., 75%, 80%, 85%, 

30 90%, or 95% homology). 
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The "biological activity" of CED-6 proteins is defined herein to mean the 
ability to regulate or affect the phagocytosis of apoptotic cells. 

The claimed CED-6 proteins also encompasses biologically active 
polypeptide fragments of the CED-6 proteins, described herein. Such fragments can 
5 include only a part of the full-length amino acid sequence of an CED-6 yet possess 
the ability to modulate or regulate phagocytosis of apoptotic cells. For example, 
polypeptide fragments comprising deletion mutants of the CED-6 proteins can be 
designed and expressed by well-known laboratory methods. Such polypeptide 
fragments can be evaluated for biological activity, as described herein. 
10 Antibodies can be raised to the CED-6 proteins and analogs, using 

techniques known to those of skill in the art. These antibodies polyclonal, 
monoclonal, chimeric, or fragments thereof, can be used to immunoaffmity purify or 
identify CED-6 proteins contained in a mixture of proteins, using techniques well 
known to those of skill in the art. These antibodies, or antibody fragments, can also 
15 be used to detect the presence of CED-6 proteins and homo logs in other tissues 
using standard immunochemistry methods. 

In particular, biologically active derivatives or analogs of the above 
described proteins, including fragments and functional domains from c, elegans 
CED-6, hi CED-6, h2CED-6, or h3CED-6, referred to herein as peptide mimetics, 
20 can be designed and produced by techniques known to those of skill in the art. (see 
e.g., U.S. Patent Nos. 4,612,132; 5,643,873 and 5,654,276, the teachings of which 
are incorporated herein by reference). These mimetics can be based, for example, on 
a specific CED-6, hlCED-6 or h2CED-6 or h3CED-6 amino acid sequence and 
maintain the relative position in space of the corresponding amino acid sequence. 
25 These peptide mimetics possess biological activity similar to the biological activity 
of the corresponding peptide compound, but possess a "biological advantage" over 
the corresponding CED-6 amino acid sequence with respect to one, or more, of the 
following properties: solubility, stability and susceptibility to hydrolysis and 
proteolysis. 

30 Methods for preparing peptide mimetics include modifying the N-terminal 

amino group, the C terminal carboxyl group, and/or changing one or more of the 



wo 99/37770 



-35- 



PCT/US99/01361 



amino linkages in the peptide to a non-amino linkage. Two or more such 
modifications can be coupled in one peptide mimetic molecule. Modifications of 
peptides to produce peptide mimetics are described in U.S. Patent Nos. 5,643,873 
and 5,654,276, the teachings of which are incorporated herein by reference. Other 
5 forms of the hi , h2, or h3 CED-6 proteins, encompassed by the claimed invention, 
include those which are "functionally equivalent." This term, as used herein, refers 
to any nucleic acid sequence and its encoded amino acid which mimics the 
biological activity of the hi, h2, or h3 CED-6 proteins and/or functional domains 
thereof. Biologically active is used to describe a protein capable of regulating the 

1 0 phagocytosis of apoptotic cells. 

A polypeptide can be in the form of a conjugate or a fusion protein, both of 
which can be made by known methods. Fusion proteins can be manufactured 
according to known methods of recombinant DNA technology. For example, fusion 
proteins can be expressed from a nucleic acid molecule comprising sequences which 

15 code for a biologically active portion of the protein and its fusion partner, for 
example a portion of an immunoglobulin molecule. For example, some 
embodiments can be produced by the intersection of a nucleic acid encoding 
immunoglobulin sequences into a suitable expression vector, phage vector, or other 
commercially available vectors. The resulting construct can be introduced into a 

20 suitable host cell for expression. Upon expression, the fusion proteins can be 
isolated or purified from a cell by means of affinity matrix. 

Expression vectors incorporating any of the above mentioned nucleic acids 
including those designated SEQ ID Nos: 1, 3, 5, 7, 9, 11, 13 or 15, optionally with a 
reporter gene as aforesaid, are also provided by the invention. 

25 The present invention also encompasses isolated nucleic acid sequences 

encoding the CED-6 (e.g., C eiegans CED-6, hi CED-6, h2CED-6 or h3 CED-6) 
proteins described herein, and fragments of nucleic acid sequences encoding 
biologically active CED-6 proteins. Fragments of the nucleic acid sequences, 
described herein, are useful as probes. Specifically provided for in the present 

30 invention are DNA/RNA sequences encoding CED-6 proteins, the fully 

complementary strands of these sequences, and allelic variations thereof Also 
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encompassed by the present invention are nucleic acid sequences, genomic DNA, 
cDNA, RNA or a combination thereof, which are substantially complementary to 
the DNA sequences encoding CED-6, and which specifically hybridize with the 
CED-6 DNA sequences under conditions of stringency known to those of skill in the 
5 art, those conditions being sufficient to identify DNA sequences with substantial 
nucleic acid identity. As defined herein, substantially complementary means that the 
sequence need not reflect the exact sequence of the CED-6 (e.g., C elegans CED-6, 
hi CED-6, h2CED-6 or h3CED-6) DNA, but must be sufficiently similar in identity 
of sequence to hybridize with CED-6 DNA under stringent conditions. Conditions 

10 of stringency are described in e.g., Ausebel, P.M., ei al.. Current Protocols in 

Molecular Biology, (Current Protocols, 1994), For example, non-complementary 
bases can be interspersed in the sequence, or the sequences can be longer or shorter 
than CED-6 DNA, provided that the sequence has a sufficient number of bases 
complementary to CED-6 to hybridize therewith. Exemplary hybridization 

15 conditions are described herein. 

Cloning of human CED-6 

Following the cloning of the C.elegans CED-6 gene and the full sequencing 
of the open reading frame, extensive searches against pubhc domain hmnan 
databases were performed. These revealed statistically significant homologies to a 

20 number of ESTs at the carboxy terminal region of the protein and one EST showed 
homology to the carboxy temiinal of the PTB domain and at the beginning of the 
charged region. These ESTs were used for construction of primers for 5'RACE 
using a Marathon- ready cDNA colorectal adenocarcinoma library firom Clontech. 
Subsequent additional sequence analysis and rounds of database searching revealed 

25 additional ESTs which enabled construction of a concensus sequence of 

approximately 2400 bp for h3CED-6 (Figure 6). Further sequence analysis has 
revealed a splice variant of the sequence shown in Figure 18 (h2CED-6), the portion 
which is alternatively spliced being underiined. The DNA of h2CED-6 is shown in 
Figure 1 9 and the amino acid sequence in Figure 2 1 . The amino acid sequence of 



wo 99/37770 



PCT/US99/01361 



-37- 



h2CED-6 is consistent with it being a dominant negative version of hi or h3 CED-6 
which antagonizes active of hi or h3CED-6. 



Assays for the identification of inhibitors and enhancers of CED-6 
hi CED-6, h2CED-6, or h3CED-6 
5 The cloning and functional characterization of C.elegans CED-6 and its two 

human homologues have permitted assay methods to be developed which allow 
identification of compounds which might inhibit or enhance CED-6, hi CED-6, 
h2CED-6, or h3CED-6 activity or inhibit or enhance the transcription of these 
proteins. These may involve detection of the level of phagocytosis of apoptotic 
10 particles, measurement of level of actin-cytoskeleton rearrangement or detection of 
the level of transcription of the CED-6 proteins via a reporter gene such as GFP, 

An assay for the identification of inhibitors and/or enhancers of phagocytosis 
may consist of a cell line stably or transiently transfected with CED-6, hi CED-6, 
h2CED-6, or h3CED-6 or any other member of the CED-6 signal transduction 

1 5 pathway. Cell lines may also be microinjected with purified protein or vectors 
expressing antisense RNA. The expression product may be a fiision protein with 
GFP. Non transfected cells can be used in the assay also. The cell line may be a 
fibroblast cell line such as COSI, BHK 21, L929, CVl, Swiss 3T3, HT144, IMR32 
or another fibroblast cell line. The cell line may also be an epithelial cell line such as 

20 HEPG2, MDCK, MCF7, 293, Hela, A549, SW48, G361, or any other epithelial cell 
line. The cell line may a primary line, such as human dermal FIBs, dermal 
keratinocytes, leucocytes, monocytes, macrophages, or any other primary cell line. 
Cells may be double transfected with other genes (like lectin, CD 14, SRA, CD36 
ABCl, CED5, DOCK180) being from vertebrate (human fish, mouse) or 

25 invertebrate origin (C.elegans). 

Phagocytosis assays consist of the addition of and uptake of particles and/or 
apoptotic cells, by these cell lines. The particle may be opsonized heat or chemically 
killed bacteria and yeast in a variety of sizes, shapes and natural antigenicities. The 
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particle or cell may be an opsonized, fluorescently labeled, heat or chemically killed 
bacteria and yeast in a variety of sizes , shapes and natural antigenicities. The cell 
may be a apoptotic neutrophils, apoptotic lymphocytes, apoptotic erythrocytes or 
any other apototic cell. These apoptotic cells may be opsonized and/or labeled with 
5 dyes or fluorescent dyes. The killed bacteria or yeast cells and the apoptotic cells are 
referred to as herein apoptotic particles. 



Assay 1 

Cells, transfected with CED-6 or any other gene described herein, for 
example, nucleic acids of SEQ ID Nos: 1, 3, 7, 9, 11, 13, or 15, can be grown in 

10 monolayer or in suspension. The apoptotic particles are added to the transfected cell. 
Phagocytosis can be followed by the uptake rate of the apoptotic particles. This can 
be measured by microscopy, by fluorescence microscopy, by quantitative 
spectrofluorometry and by flow cytometry. Cells and or particles may additionally 
be labeled with dyes, fluorescent dyes, antibodies and dyes of fluorescent dyes 

15 linked to antibodies prior to detection and measurement. Decrease or increase of the 
uptake of the apoptotic particles is a measurement for the influence of the 
transfected gene or genes in the phagocytosis. 



Assay 2 

Compounds can be added to assay 1 to test their influence on the genes that 
20 are involved in the phagocytosis pathway. Transiently or stably transfected cells are 
grown in suspension or in monolayer. A series of compounds is added to the cells 
prior to the addition of the apoptotic particles. The influence of the compounds can 
be measured by comparing the uptake rate of the apoptotic particles with and 
without the addition of the compound. Measurements are described in Assay 1 

25 Assay 3 

Cells are able to phagocytose apoptotic particles by engulfinent of particles. 
This involves the reorganization of the actin cytoskeleton. Mammalian cells, may be 
transiently or stably transfected with CED-6 or any gene involved in the CED-6 
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phagocytosis signal transduction pathway, for example, with a nucleic acid have the 
sequence of nucleotides shown in any one of SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13 or 15. 
Cells can be any cell as described ha Assay L The genes may be expressed as a GPF 
fusion product. Cells may be double transfected (see Assay 1). The reorganization of 
5 the actin cytoskeleton can be visualized with fluorescent dyes linked to phalloidine, 
which interacts with F-actin. Reorganization of the cytoskeleton is an measurement 
for the engulfinent mduction by the transfected gene or genes. Transfected cells may 
be treated with particles or apoptotic cells as described in Assay 1 . Reorganization 
of the cytoskeleton is visualized by microscopy or fluorescence microscopy. 

Assay 4 

Compounds can be added to Assay 3 to test their influence on the genes that 
are involved in the cytoskeleton reorganization related to the phagocytosis pathway 
and engulfinent. These compounds may enhance or inhibit the engulfinent or 
cytoskeleton reorganization induced by the introduced genes. Transiently or stably 
transfected cells are grown in suspension or in monolayer. A series of compounds is 
added to the cells. The influence of the compounds can be measured by comparing 
the reorganization of actin cytoskeleton with and without the addition of the 
compound Measurements as are described in Assay 1, Assay 2 and Assay 3. 
Apoptotic particles may be added in this test to induce phagocytosis, as described in 
Assay 2. 

Assay 5 

Non-transfected or transfected cell-lines such as those described above may 
be microinjected with purified CED-6 protein, for example, a protein having the 
amino acid sequence as shown in SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14, or 16 or any 
25 protein from the CED-6 pathway or a fusion protein comprising any of said proteins. 
Microinjection can be done on the primary cell lines or the fibroblast cell hues or the 
other epithelial cells lines. The cell lines can be transfected with another gene prior 
to microinjections. Assays 1 through Assay 4 can be performed on these 
microinjected cells. 
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Assay 6 

Transfected or non-transfected cell-lines as described above may be 
microinjected with a vector expressing CED-6 antisense RNA including antisense 
RNA in respect of any of the aforementioned proteins or any antisense RNA for 
5 genes involved in the CED-6 pathway. Microinjection can be done on the primary 
cell lines or the fibroblast cell lines or the epithelial cell lines. The cell lines can be 
transfected with another gene prior to microinjection. Assays 1 through Assay 5 can 
be performed on these microinjected cells. 

Assay 7 

10 Cell lines, as described in Assay 6 may be micro-injected with a vector 

expressing CED-6 antisense RNA or any antisense RNA for genes involved in the 
CED-6 pathway. Microinjection can be done on the macrophages. Inhibitory effects 
of the antisense RNA by inhibition of the CED-6 gene or genes involved in the 
CED-6 pathway can be followed and detected as described in Assay I through 

15 Assay 6. Compounds can be isolated which rescue the negative phenotype. 

Phagocytosis assays to screen for CED-6 inhibitor/enhancers in Qelegans 

The Celegans CED-6 gene promotes the engulfment of dying embryonic 
and germ cells and persistent cell corpses. Celegans may be used for detection and 
isolation of compounds that have an enhancing or inhibitory influence on 
20 phagocytosis and engulfinent. In particular mutant worms lacking CED-6 activity or 
with otherwise altered CED-6 activity may be used or alternatively a transgenic 
worm transfected or transferred with CED-6, hlCED-6, h2CED-6, or h3CED-6 
DNA may be used. 

Assay 8 

25 A series of compounds may be applied on CED-6 mutant worms or on 

worms harboring mutations in the CED-6 pathway. Restoration of engulfinent 
induced by the compounds can be visualized using Nomarski microscopy by 
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counting cell corpses remaining in the head region of LI larvae and in the gonads of 
the worms. 

Assay 9 

A series of compounds may be applied on humanized CED-6 mutant worms. 
5 Humanized worms are worms expressing the human CED-6 gene and are mutated 
for the C.elegans gene. Human CED-6 rescues the mutant phenotype. Compoimds 
inhibiting or enhancing the CED-6 phenotype can be selected by visualization of the 
engulfment phenotype using Nomarski microscopy and looking for ceil corpses as 
aforesaid. 

10 Medical applications 

The process of apoptosis has been implicated in the etiology - or associated 
with the pathology - of a wide range of diseases, including cancer, autoimmune 
diseases, various neurodegenerative diseases such as Amyotrophic Lateral Sclerosis, 
Huntington's Disease, and Alzheimer's Disease, stroke, myocardial heart infarct, 

15 and AIDS (Thompson, 1995). Thus a better understanding of the molecular events 
that underlie apoptosis might lead to novel therapeutic interventions. While much of 
the current attention is centered on the genes and proteins that control the killing 
step of the death process, it is very likely that the removal of apoptotic cells will 
prove to also be crucial for the proper overall functioning of the apoptotic program, 

20 and will offer another entry point for therapeutic intervention. 

The process of recognition and engulfment of dying cells is extremely swift 
and efficient. In animals, it is essentially impossible to find a cell with apoptoic 
features that is not already within another cell. Such rapid recognition and 
phagocytosis of apoptotic cells is an crucial aspect of programmed cell death in vivo: 

25 unengulfed apoptotic bodies can undergo secondary necrosis, leading to 

inflammation. Failure to remove apoptotic bodies also exposes the body to novel 
epitopes (from e.g., caspase-generated protein fragments), possibly encouraging the 
development of autoimmune disease. Persistent apoptotic bodies can often be 
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observed following chemotherapeutic intervention (which leads to extensive 
apoptosis) and are particularly abundant in solid tumors, in which clearance of cell 
corpses might be delayed. 

It is likely that failure to properly dispose of apoptoic cells leads to human 
5 disease. Genes involved in phagocytosis could therefore correspond to currently 
uncloned human inherited disease genes. Restoring proper phagocytosis would be a 
valid therapy for certain types of inflammation and autoimmune diseases. 
Conversely, In some cases, cells that should be maintained are inappropriately 
recognized by the engulfinent machinery and cleared from the body. Preventing the 
10 engulfinent of such cells could be of great therapeutic value. Examples of such 
diseases might include neurodegenerative diseases and stroke, as well as sickle cell 
aenemia. 

In addition activation of engulfinent could be used for the same cases for 
which it is proposed to use activation of apoptosis, e.g., cancer. Indeed, specific 

15 activation within the cancer cells of the pro-engulfing signal would lead to the cells' 
removal - (and death) - without needing to activate the rest of the apoptotic 
machinery. This could be particularly usefiil for highly resistant tumors in which 
crucial elements of the central apoptotic machinery have already been inactivated. 
Thus, in accordance with another of its aspects the invention provides a 

20 method of treating, for example inflammation, autoimmune disease and cancer by 
administering to a patient an effective amount of a substance which enhances 
phagocytosis of apoptoic cells, in particular a substance which enhances the activity 
of hl-CED6, h3-CED-6 or the signal transduction pathway in which it participates. 
Such substances includes hl-CED 6 or h3-CED-6 itself, a nucleic acid encoding hl- 

25 CED6 or h3-CED-6, an anti-sense nucleic acid to h 1 , h2 or h3 CED-6 or compounds 
identified in any of the aforementioned assays as enhancers of CED-6, hi -CED-6, 
h2 -CED-6, or h3-CED-6 or of transcription thereof. 

In addition the invention also enables a method of treatment of, for example, 
neurodegenerative diseases, stroke and sickle-cell anaemia by administering to a 

30 patient an effective amount of a substance which inhibits phagocytosis of apoptotic 
cells, in particular a substance which inhibits the activity of hl-CED6 or h3-CED6 
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or the signal transduction pathway in which it participates. Such substances include 
h2 CED-6, a nucleic acid encoding h2CED-6, an anti-sense nucleic acid to hlCED-6 
or h3CED-6 or compounds identified in any of the aforementioned assays as 
inhibitors of CED-6 or hlCED-6 or h3CED"6 or of transcription thereof 
5 Pharmaceutical compositions comprising any of the above-mentioned 

therapeutic substances and a phannaceutically acceptable carrier are also envisaged 
by the invention. 

To accompUsh the various therapeutic treatments as described herein, a 
nucleic acid which encodes hi, h2 or h3 CED-6 or a functional portion or domain 

10 thereof must be introduced into a mammalian cell (e.g., mammaUan somatic cell, 
mammalian germ Une cell (sperm and egg cells)). This can be accomplished by 
inserting the isolated nucleic acid that encodes either the fiill length protein, or the 
domains described herein, or a functional equivalent thereof, into a nucleic acid 
vector, e.g., a DNA vector such as a plasmid, virus or other suitable replicon (e.g., a 

1 5 viral vector), which can be present in a single copy or multiple copies. The nucleic 
acid may be transfected or transformed into cells using suitable methods known in 
the art such as electroporation, microinjection, infection, and lipoinfection and direct 
uptake. Such methods are described in more detail, for example, in Sambrook et al^ 
"Molecular Cloning: A Laboratory Manual," 2nd ED. (1989), Ausubel, F.M., et al, 

20 Current Protocols in Molecular Biology, (Current Protocol, 1994) and Sambrook et 
al, "Molecular Cloning: A Laboratory Manual," 2nd ED. (1989). 

hi, h2 or h3 CED-6 can be delivered to a cell by the use of viral vectors 
comprising one or more nucleic acid sequences encoding those proteins. Generally, 
the nucleic acid sequence has been incorporated into the genome of the viral vector. 

25 In vitro, the viral vector containing hi, h2 or h3 CED-6 protein described herein or 
nucleic acid sequences encoding the protein can be contacted with a cell and 
infectivity can occur. The cell can then be used experimentally to study 
phagocytosis of apoptotic cells or for assays as aforesaid or be implanted into a 
patient for therapeutic use. The cell can be migratory, such as hematopoietic cells, 

30 or non-migratory such as a solid tumor or fibroblast. The cell can be present in a 
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biological sample obtained from the patient (e.g., blood, bone marrow) and used in 
the treatment of disease, or can be obtained from cell culture. 

After contact with the viral vector comprising the hi, h2 or h3 CED-6 
protein or a nucleic acid sequence encoding them, the sample can be returned or 
5 readministered to a cell culture or patient according to methods known to those 
practiced in the art. In the case of delivery to a patient or experimental animal 
model (e.g., rat, mouse, monkey, chimpanzee), such a treatment procedure is 
sometimes referred to as ex vivo treatment or therapy. Frequently, the cell is 
targeted from the patient or animal and returned to the patient or animal once 

10 contacted with the viral vector comprising the activated mutant of the present 
invention. Ex vivo gene therapy has been described, for example, in Kasid, et aL, 
Proc, Natl Acad, Sci. USA 87:473 (1990); Rosenberg, et aL New EngL J, Med, 
323:570 (1990); Williams, et ai, Nature 310476 (1984); Dick, et al, Cell 42:7} 
(1 985); Keller, et aL, Nature 575: 1 49 ( 1 985) and Anderson, et al, U.S. Patent No. 

15 5,399,346(1994). 

Where a cell is contacted In viU'Oj the cell incoiporating the viral vector 
comprising a nucleic acid sequence of hi CED-6, h2 CED-6 or h3CED-6 can be 
implanted into a patient or experimental animal model for delivery or used in In 
vitro experimentation to study cellular events mediated by hi, h2 or h3 CED-6. 

20 Various viral vectors can be used to introduce the nucleic acid into 

mammahan cell. Viral vectors include retrovirus, adenovims, parvovirus (e.g., 
adeno-associated viruses), coronavinis, negative strand RNA viruses such as 
orthomyxovirus (e.g., influenza virus), rhabdovirus (e.g., rabies and vesicular 
stomatitis virus), paramyxovirus (e.g. measles and Sendai), positive strand RNA 

25 viruses such as picomavirus and alphavims, and double stranded DNA viruses 
including adenovirus, herpesvirus (e.g.. Herpes Simplex vims types 1 and 2, 
Epstein-Barr virus, cytomegalovims), and poxvirus (e.g., vaccinia, fowlpox and 
canarypox). Other viruses include Norwalk vims, togavirus, flavivirus, reoviruses, 
papovavirus, hepadnavirus, and hepatitis vims, for example. Examples of 

30 retroviruses include: avian leukosis-sarcoma, mammalian C-type, B-type viruses, D- 
type viruses, HTLV-BLV group, lentivirus, spumavirus (Coffin, J.M., Retroviridae: 
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The viruses and their replication. In Fundamental Virology, Third Edition, B.N. 
Fields, et aL, Eds., Lippincott-Raven Publishers, Philadelphia, 1996). Other 
examples include murine leukemia viruses, murine sarcoma viruses, mouse 
mammary tumor virus, bovine leukemia virus, feline leukemia virus, feline sarcoma 
5 virus, avian leukemia vims, human T-cell leukemia virus, baboon endogenous virus. 
Gibbon ape leukemia virus. Mason Pfizer monkey virus, simian immunodeficiency 
virus, sinaian sarcoma virus, Rous sarcoma virus, lentiviruses and baculoviruses. 

A preferred method to introduce nucleic acid that encodes hi, h2 or h3 CED- 
6 into cells is through the use of engineered viral vectors. These vectors provide a 

10 means to introduce nucleic acids into cycling and quiescent cells, and have been 
modified to reduce cytotoxicity and to improve genetic stability. The preparation 
and use of engineered Herpes simplex virus type I (P.M. Krisky, et ai., Gene 
Therapy 4(10):1 120-1 125. (1997)), adenoviral (A. Amalfitanl, et al.. Journal of 
Virology 72(2):926-933. (1998)), attenuated lentiviral (R. Zufferey, et al., Nature 

15 Biotechnology 15(9)871-875 (1997)) and adenoviral/retroviral chimeric (M. Feng, et 
al.. Nature Biotechnology 15(9);866-870 (1997)) vectors are known to the skilled 
artisan. 

Hence, the claimed invention encompasses various therapeutic uses as 

aforesaid for the hi, h2 or h3 CED-6 protein or nucleic acid. 
20 The protein may be administered using methods known in the art. For 

example, the mode of administration is preferably at the location of the target cells. 

As such, the administration can be nasally (as in administering a vector expressing 

ADA) or by injection (as in administering a vector expressing a suicide gene tumor). 

Other modes of administration (parenteral, mucosal, systemic, implant, 
25 intraperitoneal, etc.) are generally known in the art. The agents can, preferably, be 

administered in a pharmaceutically acceptable carrier, such as saline, sterile water. 

Ringer's solution, and isotonic sodium chloride solution. 

The invention also provides diagnostic reagents which may be used in the 

diagnosis of a disease associated with a defect in phagocytosis of apoptotic cells. 
30 For example, an antibody to an epitope of any of the proteins with an amino acid 

sequence as shown in SEQ ID Nos: 2, 4, 6, 8, 10, 12, 14 or 16 could be used as a 
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diagnostic reagent to detennine whether a patient has a defect in hlCED-6, hZCED- 
6 or h3CED-6 or in the expression thereof. In addition defects at the genetic level 
can be detected by using as a probe a nucleic acid having a sequence as shown in 
SEQ ID Nos: 1, 3, 5, 7, 9, 1 1, 13, or 15 or portions thereof 

5 Identification of the other proteins active in the CED-6 signal transduction pathway 
CED-6, hlCED-6, h2CED-6 or h3CED-6 can be used to identify other 
members of the signal transduction pathway promoting phagocytosis of apoptotic 
cells. There are number of possible methods by which this can be done but a 
preferred method is the so-called "two hybrid" system developed in yeast by Chien 
10 et al (1994, Proc. Natl. Acad Sci. USA 88 pp 9578-9582) which allows 
identification of proteins which bind to a particular protein of interest. 

This technique is based on functional in vivo reconstruction of a transcription 
factor which activates a reporter gene. More particularly the technique comprises 
providing an appropriate host cell, preferably yeast, with a DNA construct 

1 5 comprising a reporter gene under the control of a promoter regulated by a 
transcription factor having a DNA binding domain and an activating domain, 
expressing in the host cell a first hybrid DNA sequence encoding a first fiision of a 
firagment or all of a nucleic acid sequence according to the invention and either said 
DNA binding domain or the activating domain of the transcription factor, expressing 

20 in the host cell at least one second hybrid DNA sequence encoding putative binding 
proteins to be investigated together with the DNA binding domain or activating 
domain of the transcription factor which is not incorporated in the first fusion; 
detecting any binding of the protein bemg investigated with a protein according to 
the invention by detecting for the production of any reporter gene product in the host 

25 cell; optionally isolating second hybrid DNA sequence encoding the binding protein. 

EXAMPLES 

The N2 Bristol strain was used as the reference wild-type strain for this 
study. All strains were maintained as described by Brenner (Brenner, 1974), except 
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that worms were raised on NGM-lite agar medium. Strains were maintained and 
raised at 20°C, unless otherwise noted- The following mutations were used In this 
study: LG I: ced-l(eI735), ced'l(nl995) and ced'l(nl506) (Ellis et al, 1991); LG 
III: dpy'17(el64), ced'6(nl8I3, n2095l mec'14(u55), ncl'l(el865) ced-7(n}997), 
5 ced'7(nl892), ced'7(nl996) (Ellis et al, 1991), unc-36(e25I) (Brenner, 1974) and 
sDp3(IIl j) (Rosenbluth et al, 1 985); on LG IV: ced-2(el 752) (Hedgecock et al, 
1983), cedS(nm2) and ced-10(nI993) (Ellis et al, 1991). All mutations are 
described in Hodgkin (1997). 

EXAMPLE 1 

1 0 Analysis and Quantifying of Engulfinent 

Animals were anesthetized with 30mM NaN3 and mounted on agar pads to 
observation using Normarski optics microscope (Sulston Sc Horvitz, 1977; Avery 
and Horvitz, 1987). To quantify engulfinent of cell corpses generated during 
embryonic development, the number of persistent cell corpses that were visible in 

1 5 the head region of young LI larvae that still had only four cells in gonad (i.e., had 
hatched in the previous four hours) were scored. To quantify the germ line 
engulfinent defect, cell corpses visible within both the distal arm (where the germ 
cell deaths occur) and the proximal arm (where persistent germ cell corpses can 
sometimes be observed as they are swept along by the developing oocytes) were 

20 counted. 

EXAMPLE 2 

Germline Transformation and Genomic Rescue of ced'6 

Transgenic animals were generated using the germline microinjection 
procedure developed by Mello et al Cosmids W03A5, F20F10, F48E8, R02F2, 
25 W02G12, T06H6, C48E6, C44D7, F56D2, F43F1 2, C05D2, T06C9, C05H8 were 
injected, either singly or in groups (final concentration 20ng/ul for each cosmid), 
into ced-6(nl 81 3) animals. Plasmid pRF4 was used (final concentration 50-80 
ng/ul) as the dominant co-injection marker (Mello et ah, 1991); pRF4 carries the 
mutated collagen gene rol'6(suI006gf) and confers a dominant roller (Rol) 
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phenotype. Transgenic lines carrying stably transmitting extrachromosomal arrays 
were kept for further analysis. To assay for rescue, three-fold embryos laid by 
transgenic animals were examined for cell corpses under Normaski optics. 
Transgenic lines that generated embryos with fewer or no corpses were considered 
5 to be rescued. To further define the position of ced-6 within F56D2, a number of 
deletion constructs were created and other fragments subcloned into pBluescript 
SK(4-) II. 50-90 ng/ul of these clones were co-injected with 80-100 ng/ul pRF4 
injection marker into ced'6(nl813) worms, and their rescuing ability tested as 
described above. 

10 EXAMPLE 3 

Isolation of ced-6 cDNAs 

To isolate full-length ced-6 cDNAs, a mixed-stage Celegans lambda Zap 
cDNA library was screened (gift of R. Barstead, Oklahoma Medical Research 
Foundation, Oklahoma City, OK) using established protocols (Sambrook et aL, 

15 1989). ^^P-labeled probe was made using the rescuing 10 kb Xho I genomic 

fragment as template. Positive phage were transformed into plasmid clones using 
the in vivo excision protocol. The clones representing F56D2.7 gene from isolated 
plasmid clones were identified on a Southern blot. For this purpose a ^^P-labeled 
probe was generated from RT-PCR product, which represents three exons of 

20 predicted F56D2.7. Primers used for RT-PCR: GAATGTTCTCATTTATTG (SEQ 
ID NO.: 29) and GGATTCAAACGATCCGATG. (SEQ ID NO: 17) 

From about 300,000 plaques 10 plasmid clones corresponding F56D2.7 
cDNAs were isolated. These clones were sequenced for both ends of the insert 
using the flanking T3 and T7 primers. Two clones with partial SL2 sequence at the 

25 5' end and intact poly(A) tail were identified as full-length F56D2.7 cDNAs. 

Analysis of these sequence results and the pattem of restriction digestion by Sau3A 
I also suggested that these clones represent for one transcript. 

EXAMPLES 

Reverse transcription-PCR 
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Reverse transcription (RT)-PCR experiments were performed to determine 
the 5'end of transcripts detected or predicted within the rescuing Xho I genomic 
fragment. Reverse transcription was performed with following primers: C05D2.6a: 
GAATCTGTCCATCGCATTGC (SEQ ID NO.: 18), 
5 GAATTTCTTTGGGTAGACA (SEQ ID NO.: 19); C05D2.6b: 

GCTCTGAAGAACTGTGA (SEQ ID NO.: 20), GACGAGGTGAAGCGATTGTG 
(SEQ ID NO.: 21); F56D2.7: GGGATCAAACGAATCATC (SEQ ID NO.: 22). 
These primers were then used in combination with SLl 
(GTTTAATTACCCAAGTTTGAG (SEQ ID NO.: 23)) or SL2 
10 (GGTTTTAACCCAGTTACTCAAG (SEQ ID NO.: 24)) primers for subsequent 
PGR amplification. Total C. elegans mixed stage RNA was isolated as described 
previously. RT-PCR was performed using the Superscript Preamplication System 
(Gibco BRL), 

EXAMPLES 

15 Identification of ced-6 Mutations 

To determine whether either ced-6 allele resulted in a large physically 
detectable polymorphism, we generated Southern blots of N2, ced-6(nl813), and 
ced-6(n2095) genomic DNA digested with various restriction enzymes. A probe 
generated fi-om the rescuing Xho I genomic firagment detected noval allele-specific 

20 bands in ced-6(n2095) using four different restriction enzymes. Analysis of the 
novel restriction patterns in ced-6(n2095) indicates that this allele carries a complex 
rearrangement in this region, that covers at least part of F56D2.7, but does not affect 
the neighboring C05D2,6b transcript. 

To identify point mutations within F56D2.7, overlapping fragments of the 

25 F56D2.7 locus fi:om N2, ced-6(nl813), and ced-6(n2095) mutants were PGR 
amplified and directly sequenced using the PGR Product Sequencing Kit 
(Amersham). The overlapping PGR fragments covered the entire F56D2.7 
transcription unit and about 1 kb of upstream genomic sequence. Sequences of the 
primers used for PGR amplification and sequencing are available upon request. 
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EXAMPLE6 

Heat Shock Experiments 

To test whether ced''6 cDNA can rescue the engulftnent defect, Kpn VSal I 

fragment of fixll-length F56D2.7 cDNA was inserted in Kpn USac I site of MCS U of 
5 both pPD49 J8 and pPD49.83 vectors which carry hspl6-2 and hspl6-41 promoters, 

creating the constructs pLQhsl and pLQhs2. The two constructs were co-injected, at 

50ng/ui each with 80ng/ul pRF4, to generated stably transmitting extrachromosomal 

arrays. Forourcontrolexpeiiments, weusedpPD50.21 andpPD50.15, two 

derivatives of pPD49.78 and pPD49.83 in which the lacZ open reading frame has 
1 0 been placed under heat shock promoters. Transgenic hnes carrying these constructs 

were generated as described above. 

To overexpress ced'6 before ceil death occurs during embryonic 

development, adult animals were put on a plate seeded with E.coli and allowed to 

lay eggs for one hour. Plates were subsequently parafilmed and subjected to heat 
15 shock by transfer to SS^'C waterbath for 45 minutes. Following a 75-minute recovery 

at 20**C, adult animals were removed from the plates, 12-14 hours after heatshock, 

hatching LI larvae were scored for corpses in the head region. 

To overexpress ced-d after the formation of cell corpses during embryonic 

development, wonn plates containing embryos at all developmental stages (but not 
20 larvae) were parafilmed and subjected to heat shock in a 33°C waterbath for 45 

minutes. Three hours after the heat shock, freshly hatched LI larvae were scored for 

corpses in the head region. 

To determine the effect of ced'6 overexpression before cell death occurs on 

the engulfinent of dying germ cells, L4 stage transgenic animals were transferred to 
25 new plates and stored at 20**C. Starting 24 hours after the L4 molt, the worm plates 

were parafilmed and heat shocked for 45 minutes at 33°C as described above. 

Animals were examined for geim cell corpses at 12 hours after heat shock, also 18, 

24, 36, and 60 hours after heat shock. 

To overexpress ced-6 after the formation of germ cell corpses, L4 stage 
30 transgenic animals were collected and put into several plates, a few for each plate. 

24 hours after the L4 molt one plate of worms were heat shocked for 45 minutes as 
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described above. Similarly, 36, 42, 48 and 60 hours after the L4 molt, each plate of 
worms at one time point were treated with heat. Animals were examined for germ 
cell corpses 12 hours after heat shock. 

To overexpress ced-d in the background of other engulftnent mutants, the 
5 ced-6 or lacZ-expressing extrachromosomal arrays were transferred from 

ced-6(nl813) to a wild-type background, and crossed subsequently to ced-l(el735) 
ced-l(nl506). ced~](nI995X ced-7(1892), ced'7(nl996h ced-7(nl997X 
ced'2(nl752), ced-5(nl812) or ced-10(nl993) to generate the corresponding 
transgenic mutant strains. Heat shock experiments were performed as described 
10 above. 

EXAMPLE 7 

Genetic Mosaic Analysis 

1000 dpy'17(el64) ced-6(nI8I3) mec-14(u55) ncI'J(eI865) unc-36(e25) IH; 
sDp3(IIIJ) were put in worm plates individually. The progenies of these animals 

15 were examined to identify animals who laid only DPY UNC progenies under the 
dissecting microscope. The adult animals were examined under the Normaski Optics 
immediately after being identified. First the somatic sheath cells were examined, 
then the body wall muscle descended from D and C lineages. When all body wall 
muscle cells displayed wild-type, the duplication is lost in P4 lineage. When body 

20 wall muscle cells from D lineage are wild-type, while those from C lineage exhibit 
ncl phenotype, the duplication must be lost from P3 lineage. When body wall 
muscle cells from both D and C lineages show the ncl phenotype, the duplication 
must be lost from P2 lineage. The cell corpse in both arms of gonad were also 
examined for the engulfinent phenotype. To fmd the animals with the duplication 

25 lost in the somatic sheath cells, but not in germ cells, dpy'17(el64) ced-6(nl813) 
mec-l4(u55) ncl-l(el865) unc-36(e25) III; sDp3(IIlJ) animals were examined under 
the Normaski Optics for the loss of the duplication in somatic sheath cells. At the 
same time cell corpses in gonad were also examined for the engulfinent phenotype. 



EXAMPLES 
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Identification of a human homologue of CED-6 

Extensive searches (tblastn) with the ced-6 sequence (Figure 18 Consensus 
DNA Sequence of hCED-6) against the public domain databases (EST, Genbank, 
EMBL, Swissprot and PIR) revealed statistically significant homologies to some 
5 ESTS at the carboxyterminal region of the protein (AA443368, AA431995, 

R33389,R53881). One EST (T48513) showed homology to the Carboxyterminal of 
the PTB domain and the beginning of the charged region. For 5* RACE analyses a 
Marathon-ready cDNA colorectal adenocarcinoma, library was used fi-om Clontech. 
The position of the primers used for RACE and sequencencing is indicated in figure 
10 1 8. By subsequent cloning and sequence analysis additional sequence information 
was obtained. Using this additional sequence information and subsequent rounds of 
database searching (blastn) revealed additional EST, which enabled us to construct a 
consensus of approx 2400 hp. This sequence was fiirther extended and verified by 
colony hybridization and sequencing additional RACE products. 

15 EXAMPLE 9 

RNA Blots (see Figure 25 expression pattern of hCED-6 in normal human 
tissues and cancer cell lines by Northern blotting A) Hmnan Muhiple Tissue 
Northern (MTN) Blot B) Human Multiple Tissue Northern (MTN) Blot II C) 
Human Cancer Cell Line Multiple Tissue Northern (MTN™) Blot) 

20 A Human multiple tissue Northern (MTN- 1 , Clontech) containing in each 

lane 2 mg of poly A + RNA from eight different human tissues (heart, brain, 
placenta, limg, liver skeletal muscle, kidney, and pancreas) and a MTN-II human 
multiple tissue Northern, containing in each lane 2 mg of poly A + RNA from 
spleen, thymus, prostate, testis, ovary, small intestine, colon and peripheral 

25 leukocyte, were hybridized according to tiie manufacturer's instructions and washed 
out in 0.1 X SSC, 0.2% SDS at SS^'C. Also from Clontech, a poly A + RNA blot from 
human cancer cell lines (melanoma G361, lung carcinoma A549, colorectal 
adenocarcinoma SW480, Burkitt*s lymphoma Raji Leukemia Molt 4, lympohoblastic 
leukemia K562, HeLa S3 and promyelocytic leukemia HL60) was tested. 
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EXAMPLE 10 

Isolation of the fiiU-length human ced-6 cDNA, 

Several human EST clones including hbc3123 have been identified through 
searching variety of database. The hbc3123 EST clone was completely sequenced. 
5 One pair of primers, P (ACAATTGCCAGCTTCATAG; SEQ ID NO.; 30) and Q 
(CTGTTTTCTTGTTTCAACATC; SEQ ID NO.: 31) have been designed on the 
region of PTB domain and subsequently tested for their specificity using human 
genomic DNA as a template. The result showed that the primers are specific. One 
set of A.gtlO cDNA libraries (purchased from Clontech) including Brain, Heart, 
10 Kidney, Liver, Lung, Pancreas, Placenta, Skeletal Muscle tissues were tested using 
primers P and Q to detect whether ced-6 is expressed in any of these tissues. 

The primer Q and a primer against A.gtIO vector were used to isolate several 
PGR fi-agments using brain and pancreas cDNA libraries. These PGR firagments 
were reamplified using the same primer set and sequenced. The sequence analysis 

15 suggested that these PGR fi:agments allows the extension of cDNA 130bp upstream 
of the initiation codon of human ced-6 coding region. The longest PGR fi-agment 
was then sent to human EST database to search for more EST clones which have 
overlap with the isolated PGR fi-agments but not the hbc3123 EST clone. The 
Genbank names of these three EST clones are R65982, R65983 and AA159394, 

20 respectively. These 3 ESTs together with the PGR fragment atid hbc3123 constitute 
the fiiU-length coding sequence of human GED-6 and about 450 bp of 5TJTR. The 
human ced-6 cDNA sequenced is confirmed correctly by the sequencing data of 
hbc3123 EST clone, the sequencing data of the isolated PGR fragments and the 
sequence data of the many EST clones on the human cDNA region from human EST 

25 project. These human ced-6 cDNA data have suggested and guided any experiments 
shown in both Example 8 and Example 9. See Figure 32. 
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EXAMPLE 11 

Human Tissue Distribution of Human CED-6 

This is a further example of the human tissue distribution. Two primers 
against the PTB domain were used to detect whether the cDNA libraries contained 
5 human ced-6. The two primers have been tested using human genomic DNA as a 
template and they are specific since no background amplification was detected. The 
result of this tissue distribution study is as follows: 

I. Information obtained from cDNA library 

liSSUfi Presence of human ced-6 cDNA 



10 Brain -h- 

Heart -Hh 
Kidney 

Liver + 

Lung -H- 
15 Pancreas 

Placenta -H- 

Skeletal muscle ++ 
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H. Information obtained from human EST project 



Tissue EST clones from 

human E5lTpYnj^C-t 

Brain 2 

5 Testis 3 

Pancreas 4 

HCC ceU line 1 

Aorta 1 

Placenta 13 

10 Fetus 1 

Pooled sample 2 



EXAMPLE 12 

The technique known as FISH was carried out, the human ced-6 gene was 
localized to chomosomal position 2q32.3-q33. 

15 EXAMPLE 13 

Functional conservation between C. elegans and human ced-6 homologues; 
overexpression of hCED Rescues the Engulfinent Defect of CED-6 Mutants in c. 
elegans: 

Given that signal transduction pathways are usually conserved through 
20 evolution, it is thought that the human ced~6 homologue (hereafter referred to as 
hced-6 which encompasses hi CED-6 and/or h3CED-6) might also be involved in 
promoting the phagocytic removal of apoptotic cells in mammals. To address this 
question, we tested the human and worm ced-d genes for fimctional conservation by 
overexpressing hced-6 in C. elegans and determining whether it could ftincrionally 
25 substitute for the endogenous ced-d gene. 
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It is shown herein that overexpression of a C elegans ced-6 cDNA under the 
control of the heat shock promoters hspJ6-2 and hspl6-4} efficiently rescues the 
engulfinent defect in transgenic ced'6 mutant embryos. The same assay was used to 
test hced-6 for biological activity in C elegans: constructs were created carrying the 
5 hced-6 open reading frame under the control of hsp J 6-2 and hspJ6-4I, and ced- 
6(nl813) mutant animals transgenic for both constructs were tested for rescue of the 
engulfinent defect in late embryos and young larvae. It was found that heat-shocked 
embryos laid by transgenic mothers, but not non-heat shocked embryos, contained 
few cell corpses (Figure 31 A). These observations suggest that hced-6 can 
10 substitute, albeit weakly in the current assay, for C, elegans ced-6, supporting the 
concept that C elegans and human ced-S are functionally conserved. Further 
assessment as shown in Example 13, showed successful rescue. 

Partial rescue, or even absence of rescue in certain assays, has been observed 
previously, even in cases where functional conservation has been established. For 
15 example, Wu and Horvitz (1998a) Nature 1998al92 501-504, have found that 

DOCK180, the mammalian homologue of C. elegans CED-5, efficiently rescued the 
distal tip cell migration defect of CED-5 mutants, but not the engulfinent defect 

Experimental Procedures 

The open reading firame of hced-6 was PCR-amplified using oligonucleotides 
20 flanking the start and stop codons, and subcloned into the heat shock vectors 

pPD49.78 and pPD49.83, previously digested with Kpn I and Sac I (see before). The 
two constructs were then injected into ced-6(nl813) animals as described previously 
to establish stably transmitting transgenic lines. 

To score for rescue of the engulfinent defect in embryos and in the adult germ 
25 Une, transgenic animals were submitted to heat-shock and the number of cell corpses 
quantified as described previously herein. 
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Table 2 



Overexpression of human ced-6 homologue reduces the number of persistent 



cell corpses in ced'6(nl813) late embryos. 


Genotype 


Persistent cell corpses 


- heatshock 


+ heatshock 


Wild Type (N2) 




ced-6(nl813) +++ 


-H- 


ced'6(nl813); hs::hced-6 -H-f 





One of the isolated PCR fragments was fused to the hbc3123 EST clone. 
pLQhced-6.1, the fusion cDNA, has 130 nucleotides upstream of the initiation codon 

10 ATG. Two primers, Hhsl (GGGGTACCGAATTCTGATGGCAAC; (SEQ ID 
NO.:27)) and Hhs3 (CGAGCTCGATCAATAGTGAAGGTGAGG; (SEQ ID NO.; 
28)) were used to amplify the open reading frame of human ced-6 cDNA. The PCR 
fragment was digested subsequently with Kpn I and Sac I, and inserted into Kpn I 
and Sac I sites of both ppD49.78 and ppD49.83 heat shock vectors. The heat shock 

1 5 constructs, pLQhsl and pLQhs2, 50 ng/iul for each, whre then co-injected with a 
marker pRF4 (80 ng/jil) into the germline of adult ced-6(hl813) hermaphrodites. 
nced-6 was examined for its ability to rescue the engulfrnent defect in embryo 
progeny of ced'6(nl813) transgenic animals following an estabUshed procedure, as 
described herein. 

20 The rescuing ability of hCED-6 for the engulfrnent defect of ced'6(nJ813) in 

the aduh gonas was also tested. Transgenic animals at L4/adult molt were picked 
and put on a fresh plate. 36 hours later these animals were treated with a 45 minute 
heat shock at 33*^0. Twelve hours afler the heat shock, cell corpses were scored in 
one gonad arm. Control experiments, such as transgenic animals withut heat 

25 treatment, ced'6(nl813) animals at the same development stage with or without heat 
shock, were also used. These experiments show that overexpression of hced-6 
rescued the engulfrnent defect of CED-6 mutants in C. elegans in a germ line. These 
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experiments confirm that human ced-6's (e.g., h3CED-6) function induces the 
phagocytosis of apoptotic cells. Figures 31 A and 3 IB. 

EXAMPLE 14 

Sequences can be obtained in both deposits using T3 or T7 primers (either 
5 one or both can be used, they are at different sites of the actual insert). Both are 
commercially available fi^om Clontech (#1227 and #1228) and sequence is shown 
below 

T7 primer: 5XTAATACGACTCACTATAGGGAGA)3V (SEQ ID NO.: 25) 
T3 primer: 5XATTAACCCTCACTAAAGGGA)3' (SEQ ID NO.: 26) 
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CLAIMS 



1. An isolated protein selected from the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQIDNo.2, 

5 b) a protein comprising the amino acid sequence as shown in 

SEQ ID No. 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
10 SEQ ID No. 8, 

e) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 10, 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

15 g) a protein compmrising the amino acid sequence as shown in 

SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 

i) a protein comprising an amino acid sequence which is at least 
20 40% identical to the amino acid sequence of SEQ ID No. 2, 4, 

6, 8,10, 12,14, or 16, and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No's 1, 3, 5, 7, 9, 
11, 13 oris. 



25 



2. 



An isolated nucleic acid selected from the group consisting of: 
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a) a nucleic acid comprising the sequence of nucleotides shown 
inSEQIDNo. 1, 

b) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 3, 

c) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 7, 

d) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 9, 

e) a nucleic acid comprising the sequence of nucleotides shown 
inSEQIDNo.il, 

f) a nucleic acid comprismg the sequence of nucleotides shown 
inSEQIDNo. 13, 

g) a nucleic acid comprising the sequence of nucleotides shown 
inSEQIDNo. 15, 

h) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, or 15. 

i) a nucleic acid capable of hybridizing to a nucleic acid 
according to (h) under conditions of low stringency, 

j) a nucleic acid which encodes an amino acid sequence which is 
at least 40% identical to the amino acid sequences of SEQ ID 
Nos. 2, 42 6, 8, 10, 12, 14 or 16, and 

k) a nucleic acid which is at least 40% identical to the nucleic 
acid sequences of SEQ ID Nos. 1, 3, 5, 7, 9, 11, 13 or 15. 

An expression vector comprising a nucleic acid selected from the 
group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides 

shown in SEQ ID No. 1, 3, 5, 7, 9, 1 1, 13, or 15, 
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b) a nucleic acid comprising the complement of the 
sequence of nucleotides shown in SEQ ID No. 1, 3, 5, 
7,9, 11, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid which encodes an amino acid sequence 
which is at least 40% identical to an amino acid 
sequence of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 
and 

e) a nucleic acid which encodes an amino acid sequence 
of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 



The expression vector of claim 3, comprising DNA encoding a 
reporter gene positioned in said vector such that expression of said 
nucleic acid results in expression of said reporter gene. 

The expression vector of claim 4, wherein said reporter gene encodes 
green fluorescent protein. 

A mammalian cell-line transfected with a nucleic acid selected jfrom 
the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides 
shown in SEQ ID No. 1, 3, 5, 7, 9, 1 1, 13, or 15, 

b) a nucleic acid comprising the complement of the 
sequence of nucleotides shown in SEQ ED No. 1, 3, 5, 
7, 9, 11, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency 
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d) a nucleic acid which encodes an amino acid sequence 

which is at least 40% identical to an amino acid 
sequence of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 
and 

5 e) a nucleic acid which encodes an amino acid sequence 

of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 



7. A mammalian cell-line as claimed in claim 6, which is transfected 

with an expression vector comprising said nucleic acid and a reporter 
gene, said reporter gene being positioned in said vector such that 
10 expression of said nucleic acid results in expression of said reporter 

gene. 



8. A mammalian cell-line as claimed in claim 7, wherein said reporter 
gene encodes green fluorescent protein. 



9. A mammalian cell-line as claimed in claim 6, wherein said cell-line is 
15 selected from the group consisting of: a fibroblast cell-line, and an 

epithelial cell-line. 



10. A mammalian cell-line as claimed in claim 6, wherein said cell-line is 
selected from the group consisting of: COSl, BHK21, L929, CVl, 
SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, Hela, 
20 A549, SW48andG36L 



11. 



A mammalian cell-line as claimed in claim 10, which is an MCF7 
cell-line. 
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A mammalian cell-line as claimed in claim 7, which is an MCF7 cell- 
line. 

A manmialian cell-hne as claimed in claim 6, wherein said cell-line is 
a primary cell-line. 

A mammalian cell-line as claimed in claim 13 wherein said cell-line 
is selected from the group consisting of: human dermal FDBs, dermal 
keratinocytes, leucocytes, monocytes, and macrophages. 

A non-human transgenic animal comprising a gene encoding a 
protein, the protein selected from the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 2, 

b) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 8, 

e) a protem comprising the amino acid sequence as shown in 
SEQ ID No. 10, 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

g) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 
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i) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8, 10, 12, 14, or 16, and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No's 1. 3, 5, 7, 9, 
11, 13 or 15. 

A transgenic nematode worm which is lacking all or substantially all 
of the function of its native CED-6 gene which has been transfected or 
transformed with a nucleic acid selected from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No. 7, 9, 11, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. 7, 9, 1 1, 13, or 15. 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid encoding an amino acid sequence which is at 
least 40% identical to the amino acid sequences of SEQ ID 
Nos. 8, 10, 12, 14, or 16, and 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 8, 10, 12, 14, or 16. 

The transgenic nematode wonn of claim 16 which is C. elegans, 

A method for determining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which method comprises exposing a 
transgenic mammalian cell transfected with a nucleic acid selected 
from the group consisting of: 
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a) a nucleic acid comprising the sequence of nucleotides shown 
in SEQ ID No, 1, 3, 5, 7, 9, 1 1, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID No. 1, 3, 5, 7, 9, 1 1, 13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency, 

d) a nucleic acid which and amino acid sequence which is at least 
40% identical to the amino acid sequences of SEQ ID Nos. 2, 
4, 6, 8, 10, 12, 14, or 16, and 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 

to apoptotic particles and measuring the rate of phagocytic uptake of 
said particles by said transgenic cells in the presence and absence of 
said compound, wherein an increased rate of phagocytosis indicates 
an enhancer and a decreased rate of phagocytosis indicates an 
inhibitor. 

A method as claimed in claim 1 8, wherein said apoptotic particles are 
selected from the group consisting of: opsonized apoptotic 
neutrophils, opsonized apoptotic lymphocytes, opsonized apoptotic 
erythrocytes, opsonized killed bacteria and opsonized killed yeast. 

A method as claimed in claim 19, wherein said apoptotic particles are 
labelled. 

A method as claimed in claim 20, wherein said label is selected from 
the group consisting of: a non- fluorescent dye, a fluorescent dye, a 
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non-fluorescent dye linked to an antibody and a fluorescent dye linked 
to an antibody. 

A method as claimed in claim 18, wherein the transgenic mammalian 
cell is a fibroblast cell or an epithelial cell. 

A method as claimed in claim 22, wherein the transgenic mammalian 
cell is selected from the group consisting of: COSl, BHK21, L929, 
CVl, SWISS 3T3, HT144, IMR32, HEPG2, MDGK, MCF7, 293, 
Hela, A549, SW48 and G361. 

A method as claimed in claim 18, wherein said transgenic mammalian 
cell is a primary cell. 

A method as claimed in claim 24, wherein said transgenic mammalian 
cell is selected from the group consisting of: human dermal FIBs, 
dermal keratinocytes, leucocytes, monocytes and macroph^es. 

A method as claimed in claim 21, wherein the phagocytosed apoptotic 
particles are detected by a method selected from the group consisting 
of: light microscopy, fluorescence microscopy, quantitative 
spectrofluormetry, and flow cytometry. 

A compound identified by the method of claim 18, as an inhibitor or 
an enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 
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A method for deteimining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which comprises: 

(1) introducing into a mammalian cell a protein selected from the 
group consisting of : 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16. 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14 or 16 

c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3,5, 7, 9,11,13, or 15; or 

introducing into a mammahan cell a vector expressing RNA 
antisense that inhibits transcription of a protein in one of the 
groups a) to c) above, and 

(2) exposing the mammalian cell to apoptotic particles and 

measuring the rate of phagocytic uptake of said particles by 
said cell in the presence or absence of said compound wherein 
an increased rate of phagocytosis indicates an enhancer and a 
decreased rate of phagocytosis indicates an inhibitor. 

A method as claimed in claim 28, wherein said apoptotic particles are 
selected from the group consisting of: opsonized apoptotic 
neutrophils, opsonized apoptotic l3anphocytes, opsonized apoptotic 
erythrocytes, opsonized killed bacteria, and opsonized killed yeast. 
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30. A method as claimed in claim 28, wherein said apoptotic particles are 
labelled. 



31. A method as claimed in claim 30, wherem said label is selected from 
the group consisting of; a non-fluorescent dye» a fluorescent dye, a 
5 non-fluorescent dye linked to an antibody, and a fluorescent dye 

linked to an antibody. 



32. A mettiod as claimed in claim 28, wherein the mammalian cell is a 
fibroblast cell or an epithelial cell. 



33. A method as claimed in claim 32, wherein the mammalian cell is 
10 selected from the group consisting of: COSl, BHK21, L929, CVl, 

SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, 293, Hela, 
A549,SW48andG36L 



34. A method as claimed in claim 28, wherein said mammalian cell is a 
primary cell. 



15 35, A method as claimed in claim 34, wherein said mammalian cell is 

selected from the group consisting of: human dermal FEBs, dermal 
keratinocytes, leucocytes, monocytes, and macrophages. 



36. A method as claimed in claim 30, wherein the phagocytosed apoptotic 
particles are detected by a method selected from the group consisting 
20 of: light microscopy, fluorescence microscopy, quantitative 

spectrofluormetry and flow cytometry. 
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A compound identified by the method of claim 28 as an inhibitor or 
an enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 

A method for detexmining whether a compound is an inhibitor or an 
enhancer of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which method comprises exposing a 
mammalian cell selected from the group consisting of: 

(1) a transgenic mammalian cell transfected with a nucleic acid 
selected comprising a nucleic acid sequence shown in SEQ ED 
No. 1,3, 5, 7, 9, 11, 13, or 15, 

(2) a mammalian cell which expresses a protein selected from the 
group consisting of: 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16, 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14, or 16; and 

c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3,5,7, 9, 11, 13, or 15, and 

(3) a mammalian cell which comprises a vector expressing RNA 
antisense to a protein selected from groups consisting of: 

a) a protein comprising an amino acid sequence as shown 
in SEQ ID No. 2, 4, 6, 8, 10, 12, or 16, 

b) a protein comprising an amino acid sequence which is 
at least 40% identical with SEQ ID Nos. 2, 4, 6, 8, 10, 
12, 14, or 16; and 
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c) a protein comprising an amino acid sequence encoded 
by the sequence of nucleotides shown in SEQ ID Nos. 
1,3, 5, 7, 9,11,13, oris, and 

to a compound to be tested and determining whether there is any 
change in the organization of the actin cytoskeleton, wherein an 
increase m the rearrangment of actin cytoskeleton indicates the 
enhancer, and a decrease in the rearragnement of actin cytoskeleton 
indicates the inhibitor. 

A method as claimed in claim 38, wherein the actin cytoskeleton is 
visualized with a fluorescent dye which is linked to a compound 
which interacts with F- actin. 

A method as claimed in claim 39, wherein said linker compound is 
phalloyidine. 

A method as claimed in claim 38, wherein the transgenic mammalian 
cell is a fibroblast cell or an epithelial cell. 

A method as claimed in claim 41, wherein the transgenic mammalian 
cell is selected from the group consisting of: COSl, BHK21, L929, 
CVl, SWISS 3T3, HT144, IMR32, HEPG2, MDCK, MCF7, U293, 
Hela, A549, SW48, and G361 , 

A method as claimed in claim 38, wherein said transgenic mammalian 
cell is a primary cell. 
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44. A method as claimed in claim 43, wherein said transgenic mammalian 
cell is selected from the group consisting of: human dermal FIBs, 
dermal keratinocytes, leucocytes, monocytes, and macrophages. 

45. A compoimd identified by the method of claim 38, as an inhibitor or 
5 an enhancer of a signal transduction pathway which promotes 

phagocytosis of apoptotic cells. 

46. A method for determining whether a compound is an inhibitor or an 
enhancer of expression of a gene encoding a protein which 
participates in a signal transduction path way which promotes 

1 0 phagocytosis of apoptotic cells which method comprises: 

(1) exposing a transgenic mammalian cell as claimed in claim 7 to 
said compound, 

(2) measuring the level of expression of said reporter gene, and 

(3) comparing said expression with the level of expression of said 
15 reporter gene in the absence of said compound, wherein an 

increased rate of phagocytosis indicates an enhancer and a 
decreased rate of phagocytosis indicates an inhibitor. 



47. A compound identified by the method of claim 45, as an inhibitor or 
an enhancer of expression of a gene encoding a protein which 
participates in a signal transduction pathway which promotes 
phagocytosis of apoptotic cells. 



48. An antibody directed against an epitope of the protein selected from 
the group consisting of: 

a) a protein comprising the amino acid sequence as shown in 
25 SEQ ID No. 2, 
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b) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 4, 

c) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 6, 

d) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 8, 

e) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 10, 

f) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 12, 

g) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 14, 

h) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 16, 

i) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8, 10, 12, 14, or 16 and 

j) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No: 1, 3, 5, 7, 9, 
11, 13 or 15. 

An antibody as claimed in claim 48, which is a monoclonal antibody. 

A method of treating in an individual having a disease selected from 
the group consisting of: inflammation, autoimmune disease and 
cancer comprising administering to a patient a medicament 
comprising an effective amount of a protein selected from the group 
consisting of: 
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a) a protein comprising the amino acid sequence of SEQ ED No: 
8, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 8 
and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No. 7, 



51 . A method of treating an individual having a disease selected from the 
group consisting of: inflammation, autoimmune disease, and cancer 
10 comprising administering to a patient an effective amount of a 

compound which is an enhancer of a signal transduction pathway 
which promotes phagocytosis of apoptotic cells. 



52. A method of treating an individual having a disease selected from the 
group consisting of: inflammation, autoimmime disease and cancer 
15 comprising administering to a patient an effective amoxmt of a 

compound which is an enhancer of expression of a gene encoding a 
protein which participates in a signal transduction pathway which 
promotes phagocytosis of apoptotic cells. 



53. A method of treating an individual having a disease selected from the 
20 group consisting of: inflammation, autoimmune disease and cancer 

comprising administering to a patient an effective amount of a nucleic 
acid selected from the group consisting of: SEQ ID No. 7, 13, and 15. 



54, A method of treating an individual having a disease selected from the 
25 group consisting of: neurodegenerative disease, stroke, and sickle-cell 
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anaemia comprising administering to a patient an effective amount of 
a protein selected from the group consisting of: 

a) a protein comprising the sequence of amino acids of SEQ ID 
No.: 8, 14, 16 , 

b) a protein comprising the sequence of amino acids which is at 
lest 40% identical to the amino acid sequence of SEQ ID No.: 
8, 14, or 16, and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No,: 7, 13, or 15. 

A method of treating an individual having a disease selected from the 
group consisting of: neurodegenerative disease, stroke and sickle cell 
anaemia comprising administering to a patient an effective amount of 
a compound identified as an inhibitor of a signal transduction pathway 
which promotes phagocytosis of apoptotic ceUs. 

A method of treating an individual having a disease selected from a 
group consisting of: neurodegenerative disease, stroke and sickle cell 
anaemia comprising administering to a patient an effective amount of 
a compound identified as an inhibitor of expression of a gene 
encoding a protein which participates in a signal transduction pathway 
which promotes phagocytosis of apoptotic cells. 

A method of treating an individual having a disease selected from the 
group consisting of: neurodegenerative disease, stroke, and sickle-cell 
anaemia comprising administering to a patient an effective amount of 
a nucleic acid selected from tlie group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides of SEQ 
ID No. 7, 13, or 15, 
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b) a nucleic acid which hybridizes to the sequence of nucleotides 
of SEQ ED No. 7, 13, or 15, and 

c) a nucleic acid that encodes SEQ ID NO.; 8, 14, or 16. 

A phannaceutical composition comprising a protein selected from the 
group consisting of: 

a) a protein comprising the amino acid sequence of SEQ E) No, 
8, 14, or 16, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 8, 
14, or 16, and 

c) a protein comprising an amino acid sequence encoded by the 
nucleotide sequence of SEQ ID No. 7, 13, or 15, 

and a pharmaceutically acceptable carrier. 

A phannaceutical composition comprising a compound as claimed in 
claim 27 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 37 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 45 and a pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a compound as claimed in 
claim 47 and a pharmaceutically acceptable carrier. 
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A pharmaceutical composition comprising a nucleic acid as selected 
from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides of SEQ 
ID No. 7, 13, or 15, 

b) a nucleic acid which hybridizes to the sequence of nucleotides 
of SEQ ID No. 7, 13, or 15, and 

c) a nucleic acid that encodes SEQ ID NO.: 8, 14, or 16. 

A method for identifying proteins which interact widi the proteins of 
claim I, in a signal transduction pathway which promotes engulfinent 
of apoptotic cells comprising the steps of; 

(a) providing a host cell having a DNA construct comprising a 
reporter gene under the control of a promoter regulated by a 
transcription factor, which factor has a DNA binding domain 
and an activating domain, 

(b) expressing in said host cell a first hybrid DNA sequence 
encoding a first fusion of a fragment or all of a nucleic acid of 
claim 2 and either said DNA binding domain or said activating 
domain of said transcription factor, 

(c) expressing in said host cell at least one second hybrid DNA 
sequence encoding a pulative interacting protein together with 
the DNA binding domain or activating domain of the 
transcription factor which is not incorporated in the first 
ftision, and 

(d) determining any binding of the protein being investigated with 
a protein according to any of claims 1, 10 or 16 by detecting 
any production of the reporter gene product in said host cell. 
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An isolated protein from the nematode worm C elegans which 
comprises an amino acid sequence of from about amino acid residue 
242 to about amino acid residue 338 in Figure 2 A or an amino acid 
sequence which differs from that of said protein only in conservative 
amino acid changes. 

An isolated protein which comprises an amino acid sequence which is 
from about amino acid 1 1 to about amino acid 190 in Figure 20 or an 
amino acid sequence which differs from that of said protein only in 
conservative amino acid changes. 

A method of diagnosis of a disorder in a patient which is associated 
with a defect of phagocytosis of apoptotic cells which comprises 
exposing a nucleic acid selected from the group consisting of: 

a) a nucleic acid comprising the sequence of nucleotides shown 
inSEQIDNo. 1,3, 5, 7,9, 11, 13, or 15, 

b) a nucleic acid comprising the complement of the sequence of 
nucleotides shown in SEQ ID Nos. I, 3, 5, 7, 9, 11,13, or 15, 

c) a nucleic acid capable of hybridizing to a nucleic acid 
according to (b) under conditions of low stringency 

d) a nucleic acid which encodes an amino acid sequence which is 
at least 40% identical to the amino acid sequences of SEQ ID 
Nos. 2, 4, 6, 8, 10, 12, 14, or 16, 

e) a nucleic acid which encodes an amino acid sequence of SEQ 
ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. 

to a nucleic acid sample for the patient and detecting hybridization. 
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A method of diagnosis of a disorder in a patient which is associated 
with a defect of phagocytosis of apoptotic cells which comprises 
detecting a protein selected from the group consisting of: 

a) a protein a comprising the amino acid sequence of SEQ ED 
No. 8, 

b) a protein comprising the amino acid sequence of SEQ ID No. 
14, 

c) a protein comprising the amino acid sequence of SEQ ID No. 
16, 

d) a protein sequence having an amino acid sequence which is at 
least 40% identical to the amino acid sequence of SEQ ID 
Nos. 8, 14 or 16 or a protein sequence encoded by the nucleic 
acid sequence of SEQ ID Nos. 7, 13, or 15. 

in a sample from a patient with an antibody to an epitope of one of the 
aforesaid proteins. 

A protein which comprises a protein selected from the group 
consisting of: 

a) a protein comprising the amino acid sequence as shown in 
SEQ ID No. 2, 4, 6, 8, 10, 12, 14, or 16, 

b) a protein comprising an amino acid sequence which is at least 
40% identical to the amino acid sequence of SEQ ID No. 2, 4, 
6, 8,10, 12, 14, or 16, 

c) a protein comprising an amino acid sequence encoded by the 
nucleic acid sequence as shown in SEQ ID No's 1, 3, 5, 7, 9, 
11, 13, or 15, 

wherein said protein is fiised to another protein. 
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70. A protein as claimed in claim 69, wherein said other protein is an 
epitope tag or the product of a reporter gene. 

71 . A method for identifying whether a compound is an enhancer of a 
signal transduction pathway which promotes phagocytosis of 

5 apoptotic cells which comprises the steps of: 

(1) exposing a C elegans, in which the expression of CED-6 is 
defective or otherwise suppressed, to a compound to be tested 
and 

(2) scoring for return to wild-type phenotype. 

10 72. A method for determining whether a compound is an enhancer or 

inhibitor of a signal transduction pathway which promotes 
phagocytosis of apoptotic cells which comprises the steps of: 

(1) exposing a transgenic C elegans as claimed in claim 17 to the 
compound to be tested, and 

15 (2) measuring the level of phagocytic activity by scoring apoptotic 

corpses in the heads of LI larvae and/or the gonads. 

73. An isolated protein which is an adaptor molecule in a signal 

transduction pathway which regulates phagocytosis of apoptotic cells. 



74. Use of a protein or nucleic acid for use in therapy, e.g., inflammation, 
20 autoimmune disease or cancer, comprising: a protein comprising an 

amino acid sequence of SEQ ID NO.: 2, 4, 6, 8, 10, 12, 14, or 16, a 
nucleic acid that encodes SEQ ID NO.: 2, 4, 6, 8, 10, 12, 14, or 16, an 
amino acid sequence encoded by SEQ ID NO.: 1, 3, 5, 7, 9, 1 1, 13, or 
1 5, or a nucleic acid sequence of SEQ ID NO.: 1 , 3, 5, 7, 9, 1 1 , 1 3, or 
25 15. 
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Use of a protein or nucleic acic for use in therapy, e.g., 
neurodegenerative disease, stroke or sickle cell anemia, comprising: a 
protein comprising an amino acid sequence of SEQ ID N0» 2, 4, 6, 8, 
10, 12, 14, or 16, a nucleic acid that encodes SEQ ID NO.; 2, 4, 6, 8, 
10, 12, 14, or 16, an amino acid sequence encoded by SEQ ID NO.: 1, 
3, 5, 7, 9, 11, 13, or 15, or a nucleic acid sequence of SEQ ID NO.: 1, 
3, 5, 7, 9, 11, 13, or 15. 
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Overexpression of ced-B Rescues 
the Engulf ment Defect of ced-G Mutant 
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Overexpresslon of ced-5 Rescues the Engulf ment Defect of 
ced'S Mutant During Embryonic Development 
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Can ced'6 also promote the engulf ment 
of Persisting corpses? 
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Overexpression of ced-6 Rescues the Engulfment Defect 
in the Adult Gonad, and ced-6 Might Act in Somatic Sheath Cells 
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GGTGATGAGCCCTTGGGTTCTCGCTCCGACTGCTAAATTCGCTTGGCCGGGTCCACC^ 
CGTaGCCTCACTCGCCACACGGATCAGAATCCGGAGCAGGCAGTTCTCTCTATTCTGAGGC 
TCCTGCGGCTGCCGCGCTGACTTCCCTGTGTGGNGGAGGG.^^^CTCrGGGCAGGCTGGTTTI 
CTTGGAATGTGTTTACGATGTTGAATGGGACTTGAACAGGAAGCTGGACGCTGCAGCTGG 

aactagcgtgccaagttatttatgattccatctgatatacataggagagaaactgataga 

agaattctgatggcaactgtatgatagaagctatataa.a.gtcaagtgtccattttctttca 

actatatttgagcatacccaggatttaagtcgtggaactga-^camatttggctgatcct 

catcatgaaccgtgcttttagcaggaagaaagacaaaacatggatgcatacacctgaagc 

tttatcaaaacatttcattccctataatgcaaagi 1 ic'l' i' ggcagtacagaagtggaacag 

ccaaaaggaacaga.\gttgtgagagatgctgt.aaggaaact.aaagtttgcaagacatat 

caagaaatctgaaggccagaaaattcctaaagtggagttgcaaatatcaatttatggagt 

aaaaattctaga-\ccca,a.a.acaaagg aagttca.a.caca.\ttgccagcttcatagaatatc 

ll-i'l"l-gtgcagatgata.a.v\ctgacaagaggatattcactttcatatgcaa.\gattctgag 

tca.au^taaacatttgtgctatgtatttgacagggaaaagtgtg ctgaagagatcact^ 

caattggccaagcarrrgacctggcatacacga.\atttctaga^tcaggaggaa.^agatg 

ttgaaacaagaaaacagatcgcagggttacaaaaaaga.^tccaagacttagaaacagaa 

aatatggaacttaa,\.\.^taaagtacaagatttggaaaacca,^ctgagaataactcaagta 

tcagcacctccagcaggcagtatgacacctaagtcgccctccactgacatctttgatatga 

ttccattttctccaatatcacaccagtcttcgatgcctactcgc.aatggcacacagccacc 

tccagtacctagtacatctacrgagattaaacgggacctgtttggagcagaaccttttgac 

CCATTTAACTGTGGAGCAGCAGATTTCCCTCCAGATATTC.A^^TCAA,'\ATTAGATGAGATGC 

AGGAGGGGTTCA,\.AATGGGACTAACTCTTGAAGGCACAGTATTrTGTCTCGACCCGTTAG 

ACAGTAGGTGC TGA CATCAAGAACAAGAAATCCTGATTCATGTTA.A^TGTGTTTGTATAC 

ACATGTCATTTArTATTATTACTTTAAGATAGGTATTATTCATGTGTCA.\TGTTTTT 

TTTTAATATTTTGAA.A\TTTTCTCAGTTAAATTTCCT^ 

TATmAAAAACAGCTTACTGTAAAGTAGATCATACnTLTATGTTCCmCTGTTTCTACTG 
TAGATGAATTTGTA.^TTG.A-AAGACATATTATACA.\ATACCTGCCTTGTGTCTGAGTTCTAT 
TTAGTTAGCATCTTG.VXATTTGTATTCATTTTCCAGATGGCTAGTTTATTAATC 
A.AAGCCATACCTTAAAGATAACTTTTTAAATTCTGAAGAGACATGCCAATGTCA.JuACTAA 

acatgttctgl'ni"i\a\accaacaaacatgttactarrcattggacagatatcattttatgt 

ataaatactgttcacatcactgggaaaatgtaaactttaaacataatgccacaaggtcac 

taatttctagcaggtaaaattataaggatataaattccaataataa.^ccaaatgtatttag 

agtamattactaaatgcaaggtgatgttagttatgatcagttatactctaaatattta-^ 

tttgttttataaaggtagtgaaaaaatgaa.aatttgctatttattaa.aaaa 

cattccaaatgagata.agtgatattactataacatctaagcatcatctgatttgatattcc 

CTA.AAAAACATTTGGAATATATGCTATCTATAGATTCAGTATCTACTACCCATATTTACTTT 

ACCAAATATATTTCTCCTCACTGCATAAGGACTACTCTTCTCATATTTTCTTCTTTGATGAA 

GATATTTrrCACCAAAGTTTATTTTGTGATGCCCTCTTGGTTTTGATACTTTAAA-ATCTGTG 

GCACCCGTTCTACATGAATTATCAATATTrGGTAAATTCAATCTGTATTTGTTTTCTTAAAG 

TCAAAAATCTCATTTTCCAAAAA.A*AAA*AAAAAAAAACTCGAG 
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GGTGATGAGCCCTTGGGTTCTCGCrCCGACTGCTAAATTCGCTTGGCCGGGTCCACCTTCT 

CGTGGCCTCACTCGCCACACGGATCAGA-^TCCGGAGCAGGCAGTTCTCTCTATTCTGAGGC 

TCCTGCGGCTGCCGCGCTGACTTCCCTGTGTGGNGGAGGGAACTCTGGGCAGGCTGGTTTT 

CTrGGAATGTGTTTACGATGTTGAATGGGACTTGAACAGGAAGCTGGACGCTGCAGCTGG 

A-'VCTAGCGTGCC^AAGTrATTTATGATTCCATCTGATATACATAGGAGAGAAACTGATAGA 

AGAATTCTGATGGCAACTGTATGATAGAAGCTATATAAAGTCAAGTGTCCATrrrCTTTCA 

ACTATATTTGAGCATACCCAGGATTTAAGTCGTGGAACTGAACATTTATTTGGCTGATCCT 

CATCATGAACCGTGCTTTTAGCAGGAAGAAAGACAAAACATGGATGCATACACCTGAAGC 

TTrATGAAA.\CATTTCATTCCCTATAATGCAAAGTTT^ 

CCAAAAGGAACAG.-\.\GTrGTGAGAGATGCrGTAAGGAAACTAAAGTTTGCAAGACATAT 

CAAGAAATCTGAAGGCCAGAAAATTCCrAAAGTGGAGTrGCAAATATCAATTTATGGAGT 

A-'KAAATTCTAGA.^CCCAA.AACAAAGGCrGAAGAGATCACTTTAACAATTGGCCAAGCAT^ 

TGACCTGGCATACACGA.AATTTCTAGAATCAGGAGGAAAAGATGTTGAAACAAGAAAAC 

AGATCGCAGGGTTACAA-A-AAAGAATCCAAGACTTAGAAACAGAAAATATGGAACTTAAA 

A.ATAAAGTACA.AGATTTGG.A.AAACCAACTGAGA.\TAACTCAAGTATCAGCACCTCCAGCA 

GGCAGTATGACACCT.AAGTCGCCCTCCACTGACATCTTTGATATGATTGCATTTTCTCCAA 

TATCACACCAGTCTTCGATGCCTACTCGCAATGGCACACAGCCACCTCCAGTACCTAGTAG 

ATCTACTGAGATT.\.A.\CGGGACCTGTTTGGAGCAGAACCTTTrGACCCATTTAACTGTGGA 

gcagcagatttccctccagatattcaatcaaaattagatgagatgcaggaggggttcaaa 
atgggacta,actcttga.aggcacagtattttgtctcgacccgttagacagtaggtgctga 

CATCAAGA-ACAAG.A.\.\TGCTGATTCATGTTAAATGTGTTrGTATACACATGTCATTTATTA 

itattactrr.a.agataggtattatrcatgtgtcaatgtttttgaatatm 

aattttctcagttaa\ttrcctcaccttcactattgatctgtaat'rr^ 

cttactgtaaagtagatcatacttttatgttcctttctgtttctactgtagatg^^ 

attgaaagacatattatacaaatacctgccttgtgtcrgagttctatttagttagcatctt 

ga.\atttgtatrcattitccagatggctagtttattaatgamcccaa-aagcc 

aagataacl iti-iaaattctgaagagacatgccaatgtcaaactaaacatgttctgi'n 1 1 

aa.^ccaac.^^acatgttacrattcattggacagatatcatmatgtataaatactgttca 

catcactggg,a.\aatgtaaacttraaacataatgccacaaggtcactaatttctagcagg 

taaaattata-aggatataaattccaata.^taaaccaaatgtatttagagtatttattagta 

AATGCAAGGTGATGTTAGTTATGATCAGTTATACTCrAAATATTTAATTrGTT^ 

GTAGTGAAA.AAATG.A.\-A.A.TTTGCTATTTATTAAAAAACATTAAAmCAT^^ 

AT.^AGTGATATTACTATAACATCTA.AGGATCATCTGATTTGATATTCCCTA.AA.\AACAm 

GG.\ATATATGCTATCTATAGATrCAGTATCTACTACCCATAmACmAGCAAATATATTr 

CTCCTCACTGGATAAGGACTACTCTTCTCATAl'i'l'rcri'a'I-i'GATGAAGATA'ri'ri TCACC 

AA.\GTTTATTrrGTGATGGCCTCTTGGTTITGATACTITAAAATCTGTGGCACCCG^ 

ATGAATTATCAATATTTGGTAAATTCAATCTGTATTTGTTTTGTTAAAGTCAAAA-ATCTCAT 

TTTCCAAAAAAAA.\.\AAAAAAAACTCGAG 
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^f^•RAFSRKKDKT^V'^^HT?EALSKHFIPYNAKJ^LGSTEVEQPKGTEVVllDAVRKLK^^^ 

EGQKJPKVELQISIYGVKILEPKTK EVQHHCOLHRISFCADDfCTDKRIFTFICKDSESN^LCYV 

FOSEKCA£ErTLTIGQAfDL\VTKFLESGGK:DVETILKQ(AGLQFC^ 

^'OLmoySAPPAGS^ETP.^SPSTDlFD^CIPFS?[SHQSS^^PTR^•GTQPPPVPSRSTE^KRDLFGAE^ 
pDPF^rG,A.<KDF?PDrQSKI.D£\fQEGFX\{GLTLEGTVTCLDPLDSKC* 
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FIG. 22 



Human ced-6 cDNA and protean 



GGTGAOGAQC CCTTOGGTTC TCGCTCCGAC TGCTAAATTC GCTTGGCCGG GTCCACCTTC TCGTGGCCTC ACTCGCCACR. CGOATCAGAA rtCCGQMCAG 100 



QCAGTTCrCT CTATCCTGAG GCTCCTCCGQ CTGCCGGCTO ACWCCCTGT GTGCGG0W3G GAACTCTGCG aWMCTGGTr TTCTTGGAAT 5Tt3TTTACGX 300 



TGTTGAATGG GACTTGAACA GGAAGCTGGA CGCrCCAGCT OGAACTAGCG TQCCAAGTTR TTOMJOATTC CATCKJATAT ACATAGGAGA GAAACTOATA 300 



GAAGAATTCT GATGGCAACT GTATGATAGA AGCTATTATA AAGTGAAfiTG TCCArWTCT TlCTlAdXTA TTTSAGCATA CCCAGCSATTT AACTCGTOGA <0a 

ACTGAACATT TATTTGGCTC ATCCTCATGA TGAACCGTGC TTTTAGCAGG AAGAAAfiACA JWACATGGAT GCATACACCT GAAGCTTTAT CAAAACATTT 500 

M NRA r3R KKDK TWK HTP E ALS KHP 

CRUTCaXAT AATOCAAAer TPCTTSGCAG TACAiSAAGTG GAACAGCCAA AACGAACAGA AGTTSTaAGA GATGCTGTAA GGAAAGTAAA OTTTOCAAaA SQQ 
I?Y WAKF LG3 fg^T E QPK GTE VVR Dft.VR XLK FAR 

PTB domain 

CATAICAAGA AATCTGAAra CCAGAAAAirr CCTAAAGTCG AGITGCAAAT ATCAATTTAT GGAGTAAXAA rTCTAGAACC CAAAACAAAG GAAGTCCAAC 700 
KIICK SBG OKI PKV -S LOI STY GVK I L5P KTTC gVQH 

ACAATTGCCA GCTTCATACA ATATCTTTTT GTGCAGATQA TAAAACTGAC AAOAGGATAT TCACTTTCAT ATTJcaUAjQAT TCTGAGTCAA ATAAACAaTT BOO 
^ r. O T.tfP TffPr ABD TCTD Kfl lF TFT PKn RF. BN KTtr. 

GTCCTATOTA TTTGACAGCG AAAASTGTGC TGAAGAQATC ACTTTAACAA TTGGCCAAGC ATTTGACCTG GCATRCACQA AATTTCTAGA ATCAGGACGA 900 

C? Y V 17 n B TC ^ B F T T T. T r Q 0 A F ^ ^ A Y T TC F r. R S Q G 

AAAOATGTTG AAACAAGAAA ACAGATCGCA aGGTTACAAA AAACAATCCA AGACTTAGAA ACAGAAAATA TGGAACTTAA AAATAAAGTA CAAGATTTGG 1000 
K D V £. ^„R„K_a.L A. -^-J'-i2-K_ R. i. -i>^_E_ L B. _E_I._K„ IL V. 

Charged regfon 

AAAACCAACP SAGAATAACP CAAGTATCAG CACCTCCAGC ACGCAGTATG ACACCTAAGT COCCCTCCAC TQACATCTTT GATATCATTC CATmCTCC 
_?__Q_^I*^ I T Q V S A PPA GSM *gPX3 P ST. DIP PMIP F5P 

AATATCACAC CAGTCTTCGA TGCCTACTCG CAA3SGCACA CAGCCACCTC CAGTACCTAG TAOATCTACT QAGATTAAAC GGOACCrGTS TGGAGCACAA 120fl 
ISH 0S5H PTy WGT QPPP VP_S n5T SIKR Dt. F GA -E. . . 

prolina/aarine rich rogfon (potential SH3 binding domain) 

CCTTTTGACC CATTXAACTG TGGAGCAGCA GATTTCCCTC CAGATATTCA ATCAAAATTA GATGAQATGC AGGAGOGGTT CAAAAIIGGGA CTAACTCTTQ 1300 
P.FDP FWC G A.A DFPP BIO 5KL OSMO'E GF KKG LTLE 

AAGGCACACT ATXrCGTCX GACCCGTTAG ACAGCACGTG CTGACATCAA GAACAAGAAA OXICTGATTCA TCTTAAAXGT GTTTGTATAjC ACATGICAirr 1400 

GTV FCL DPLD SRC 
TATTATTATr ACTOAAGAT ACCTA3TATT CATCrGTCAA TOTrTTTGAA TATTTIAATA TTTTGAAAAT TTTCTCAGTr AAATTTCCTC ACCTTCACTA IS 00 

TTGATCTCTA ATTXTTATTT TAAAAACACC TTACTGTAAA GTAGATCATA CTTTTATCTT CCTTTCTGTT TCTACIGTAG ATOAATTTGT* AATTOAAAGA 1600 

CATATCAiac AAATACCTGC GTTGTGTCTG AGTTCTATTT AGTTAQCATC MOAAATTTG tATTCATTTT CCAGAIGGCT AGTTTATTAA TGATTTCCCA ^70° 

AAAQCCATAC CTTAAAGATA ACTTTTTAAA TTCTGAAGAG ACATGCCAAT GTCAAACTAA ACATGTTCTC TTOTAAACC AACAAACATQ .TTAt^TATTCA 1800 

TTGCWWaWAT ATCAITTTAT GTATAAAffAC TQTTCACATC ACXGGGAAAA TGTAAACTlT AAACATAATG CCACAACGTC ACTAATTTCT AGCAGGTAAA 1900 

ATtATAAGGA TATAAATTCC AATAATAAAC CAAATOTATT TAGAGTATTT ATTAGTAAAT GCAAGGTGAT GmCTTATC ATCAGTTATA CTCTAAATAT ^OOO 

TTARTTIG'Pr TTATAAAGGT AGTGAAAAAA TGAAAATTTG CTATTTATTA AAAAACATTA AATTTCATTC CAAATGAQAT AAGTGATArP ACTAa-AACAT 2100 

CTAACCATCA TCTGATTTOA TATTCCCTAA AAAACATITTG GAATATATGC TATCTATAGA TTCAGTATCT ACTACCCATA TTTACTTTAC CAAATATATT 3200 

TCTCCOCACr GCAaSRAflSAC TACTCTICTC A!CATTTTCT2' CtTlGATGAA GATATTTrPC ACCAAACTTT A'm' T G T GAT GCCCTCTTGO TTOGATACT 2300 

TTAAAATCTG TGGCACCCGT TCTACATQAA TTATCAAtAT IWSTAAATT CAATCTGTAT TTCTTTTCrrr AAAGTCAAAA ATCTCATTTT CCAAAAAAAA 2400 

AAAAAAAAAA AC 
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SIM output vith parameters; sviJbatltution acor&s in &LOSUK62 O « 12/ s « 4*< 

Sequence 1: C£D-£/ 4 92 residues 

Seciuence 2: hCSO£> 304 residues 

List of local aligiunenca wiCli score >* 35. 0 

47,5% identity in 134 residues overlap; Score: 3as,0; Gap frequency: 2.7% 

CED-fi, 4S RTWIHPPDYLINGHVEYVARFLGCVETPKjVNGSDVAHEAIHAIRFQRDLKRSEQT^ 

hCEDfi. 11 KTW>ffITPEALSKHFIPYNAKFLGSTEVSQPKGTBVVRDAVBKLKFARHI5aCSE GQK 

* » « • * * * *«« * *«** w*-* * 

laS LQKVEIRISIDNVIIAIDIKTKA'PMYTFPLGRISFCW3DKDDKJl«FSFIARAEGA5GXPSC 
hCEDS, S7 IPKVEZ«QrSIYGVKILEPKTKSVQKNCaLKSXSFCADDICTDKRIFTriCK:-DSESNKHLC 

CED-fi , ISS YAFTSEKrAEDITLTIGEAFDIAYKRFLDKKRTSLENQKQlYILiaCKIVELETENQVLIS 

hCEDfi , 12fi YVFDSEKCAEEITLTIGQAnaiAYTKFLESGGKDVETilXQrAGLQKKIQDLETENMELKN 

* * «« -fr *** * « «- » 

CED-^. 225 RIAE 

hCEDG, laff KVQD 



31.5% identity in 38 residues overlap; Score: 38.0; Gap frequency: 0.0% 



CED-5, 
hCED£ . 



265 PNrPPSSIYSMPRANDLJPTEMAPTLPQISTSSNGASP 
221 PFSPISHQSSMPTRWGTQPPPVPSRSTEIKRDLFGAEP 
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FIG, 25(A) 



FIG. 25(B) 
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Figure 28A 
hced-6 alignment Formatted Alignment 

R659B3ygenbanlc. — ^- 

//\iXMd'€ CONA. /d'<SGTGMXaGC CCaTGGGnTC TCGCTCCGAC TGCTAAATTC Ol ' VlU GCCGG 50 

Tha PCR firagmeat 

(lbc3123 EST dona 

y5'/R6Sae2/5enbnnk r ^GCEAAATTC GCTTGGCCGG 19 

^5*/m59394/genbank ^tKHGATOAGC OCTIGGGTIC TOSCTCCmc SO 

Consenfluff ' GGTGATGM3C CCrTGOGTIC TCGCTCC(5AC TGCTAAATIC GepIGGCCBG SO 



B033UJ/ge«oaxini- 

//hcad-6 CDMA GTCCACCTTC TajTGGCCIG ACTCGCCACA CGGATCAOAA. 1CCGC5AGCAG 100 
Tha ?CR fragmant ^- ' -■ 



iyS'/R65882/genbank GTCCACCCTC TCGTOGCCTC AdCGCCACA. CQC3ATCAGAA TCKGOAQCAG S9 

//SVAAl59394/genhaxilc (SICCACCTTC TCGTGOCCTC ACTCGCCACA COCSATCAGM. TCCGGAGCXG ' 100 

Consensus CTICCACCnC ICfflGGCCTC ACTCGCCACA CGGATCAGAA TCCGGAGCAG 100 

/ThMd-fi^'craSA^* GCAGTOTCr CTATTCTCSAG GCTCCrGCGG C-TGCCGQC- TGACTTCCC- 14*7 

The PCR fragment * 

V/R6SQ92/genbaiik * GCAC?rTCTCT CTATTCTSAG G CTCCTg CQG C-TGCCGGC- TOACITCCC- llfi 

// S'/Aai53394/genbawk GCAGTTCTCT CXATTCTGAG GCTCCTCCGG CNTGCCM3CG TGAdTCCCG ISO 

Consensus GCACSTTCTCr CTRTTCTGAa GCTCCTGCGG CWXGCCNGCG TOACncCCG ISO 

RS5983/gaE3banJc. 

//hcad-6 CDNA TGTQTGCQGG AGGGAAGTCT GGGCAGGCTG GTTriCTTCG AATSTGTTEA 19t 

The PCR fragment * 

hbc3133 EST clone 

^5'/B6S3a2/gGnbank TOTGTGCGGG AGGGAACTCT GGQCaGOCTG G TmC TTGG liKlXA^^n^)^ 166 

//S'/AAlS9394/genbank TGTGIGGHGG AGGGRACTCl' GGGCAGGCTG GmTCTTCQ AATOTCTTm 200 

' .consensus TGTGTGSNGQ AGGGAACTCT GGGCAGGCTG <3rTTrCrrGQ AATCfrGTTTA 200 

;^ft659a3/genbanlc. ^^'trtGA AACGGCSH&AC CGGGCCMCTG CAAGCNGGAA. 3S 

//hGed-5 CCSJA CGATSTTGAA TGGGACTrG- AACAGGAAGC T-GGACGCIG G-AGCTQGAA 344 

Tha PCR. fragmsnt . ■ — «~— — - 

hl:c3ia3 SST clona 

^5Vn55802/'genbank CGATGTOAA TGGGACTKS- AACAGGAACC T-GGACGCTG C-AGCTGGAA 213 

//3'/AAlS9394/genbank CGATQTTOAA TGCqaClTO- AACAGGAACC T-GGACGCIG C-JtfS^^ 247 

ConsausuH CGAIGirGAA. TGGGACTIGA AACRGGMAftC YGGGMCNCTQ CAAGCNGCAA 2S0 

/;'Rfi5983/genbank. CTACCOTGCC CAAGraVrTT ATGW.]CCCCA CCTOATATAC ATGG3AGAGA. 85 

^ hced-6 cDWv CTAGCGTG-C CTVAGTEATTT ATGA-TTCCA TCTGATATAC ATAGGAGAGA 292 

The PCR fragaienc — — — .-u 

.^'S'/RSSaa^/genbank dAGCGTG-C CAAGTTATTT ATGA-TtCCA TCTQA TAlftC ATAflOAGAGA *2S1 

,'/5'/AAlS9394/genbank CTAGCGTG-G CAAiGTTAniT ATGA-TICC\ TCTGNTAXAC ATAGGAGAGA 295 

Consensus CTASCGTGCC CAAGTTATIT ATtSANXTfCCA YCTGNIATRC ATBGOAGAGA 300 

i^*Rfi59a3/g«abank. AAC-TGATAG AAGAA1TCTG ATGGCAACT3 TATGATAGAA GCTA-TATAA .133 

,^'hcod-6 cDNA AACT-GATRG AAGAATICTG ATGGCAACIQ TATOATAGAA GCTATTA'HAA 341 

PCR fragment tfJ^^AGAATTCTG ATGGCAACTO TATGATAGAA GCTATTATAA 40 

hbc3123 EST clone 

j^^5*/Rfi5BB2/genbRnk AfiCT-GATAS AAaAATrCTQ ATOGCAACTG TATGATAGAA GCTAT-ATAA 3QS 

^ S'/AA159394/ganbttnk AACTTGATAG AAGAATTCTG ATGGCAACTO TMtSATAGAA GCTA-TATAA 344 

Consensus AACTTCSATAG AAGAATTCTG ATGGCAACTO TATOA'TOQAA GCTATTATAA 350 

/^R6S993/genbank. AGTCAAGTOT CCArriTCrT TCARCTATAT TIGAGCATAG CCAGQATrTA 183 

//hC»d-6 CDNA AGTCAAGIGT CCATmCTT TCAACTATAT TTGAiGCATAC OCAGGATTEA 391 

//The PCR fragment AGTCAAGTGT CCATTTTCTT TCAACTATAT TTGACCATAC CCAGGATTIA 90 

hbc3123 ESV clone 

/*'5 VR65882/genbanlc AGTCAAGTGT CCATTITCIT TCAACTATAT ITGAGCATAC CCAOGGTOA 359 

*/S'/AA159394/genfaank AGICAAGTGT CCATTOTCTT TCAACTATAT TTGAGCRTAC CCflOQATITA 334 

Consensus AGTCAAGTOT CCA'ri'i'iUi'l' TCAACTATAT TTOAC3CATAC CCAfiGRTTTA 400 

>/jR6S903/genbanfc, AGTCQTC3SAA CTC3AACATTT ATTTCGCTGA TPCTCATCAT -GAACCGT3C 232 

V hC«d-5 cDNA AGTCQTGGAA CTGAACATTT ATTTGGCTGA 'IggTCA TCAT Q-ARCCGTC3C 440 

./The pen fragment ACTCGTSGAA CTGAACATTT ATTTGGCTGA TCCTCAT2AT G-AACCGT£3C 139 

hbc3123 EST done 

-^'S'/R6S8a2/genbttnJc . AXrECGTGGAA CTGAACATTT ATITSGCTGA TCCTCATCAT GGAACCGTGC 409 

,/5'/AAlS9394/genb«nk agtcqtGGAA CTGAACATT . AT 415 

consensus ACTCGTSGAA CTGAACATTT ATnGGCIGA TCCTCATCAT QGAACCGTQC 450 
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rigure 28B 
hced-6 alignment Formatted Alignment 

^aSS983/g«nbank* TTCTAGCAGQ AAGAAAGAiCA. AAACWGC3AT OCATACACCT GAAQCTriAT 282 

<tf'hCBd-e CDNA TOWACCAGG MGAAAOACA AAACA.TQGAT CSCMACACCT GAAGCTmT 490 

f Ttoa FCR Eragment , TTmCCAGG AAGAAAGAOV AAACATGSMP GCXIACACCT GAAGCXTIAT 139 

hbc3123 EST. clone — 

jyS'/R6S882/genbank TTTTAGCAGG AAGAAAGACA AAACATC3£3GT QClNhCACCT GAftG-WITAT 458 

S' /AAlS9394/genbanlc 7 415 

Consetiaug TrtTAGCAGG AAGAAAOACA AAACATGGRT GCWNACACCT GAAI3CNTTAT SQQ 

«i^R€S9Q3/9enbai^4 CAAAACATTT CATTCCCTAX AATGCAAACST nCTTGGCAG TACAGAAGTQ 332 

^•hced-S cim CAAAACATTT CATICCCTAT AATOCAAAGO^ -TTCTTGGCAG TACAGAAGTIG 540 

//Tha PCK tragmenC CAAAACATTT CATICCCTAT AATGCAAAGT TTCTTGGCAG tACAGAAGTC 239 

hbc3l23 EST clone — 

4^S'/R658B2/genbank CAAAACNTT- C-TTTCC— NAT — — ^ — TT— — - — — — -~- , 478 

5'/AAlS9394/3enbanic 415 

Consensus CAAAACNTTT CATTYCCTAT NATGCAAAflT TTCnGGCAG TACAGAAGTC 530 

^n65983/genbank. GAACAGCCAA AA6GAACAGA AGTIGIGAGA GATGCTGTAA GGAAACTAAA. 382 

hc«d-6 cim GAACAGCCAA AAGGAACAGA AGTrGTGAGA GATGCTGTAA. GGAAACTAAA 590 

// The pen fragmertt GAACAGCCAA AAGGAACAGA AQTTGTGAGA GATGCTGTAA GGAAACTAAA 289 

hfac3l23 EST Clone 

5 • /R65882/genbaiilc 478 

5'/AAlS9394/genbanic 415 

Conaonaus GAACAGCCAA AAGGAACAOA AGTTGTQAGA GATGCTGTiVA. GGAAACTAAA 600 

^R65983/genbank. GrTTOCAAGA CATNTCAAGA AATCTGAACG CCAAAAAA-- = 420 

irbced-6 CDMA GTTroCAAfiA CATATCAAGA AATCTGAAGG CCAGAAAATT CCTAAAGTGG fi40. 

//the pen fragment GTTTCCAAGA CATATCAAGA AATCTGAAGG CCAGAAAATT CCTAAAGIGG 339 

hbc3123 EST clone — ■ 

5VK658B2/ganbank r 478 

5VAAX59394/g«nbank ' 415 

Consensus OmGCAAGA CATNTCAAGA AATCTGAAGG CCARAAAAIT CCTAAAGTGG 550 

RfiS983/genbank. AA ^AAAAA ^AG 429 

j^hc«d-6 cCNA AGITGCAAAT ATCAATTTAT GGAGTAAAAA TTCTAGAACC CAAAACAAAG S90 

i/Trtie PGR fragment AGITGCAAAT ATCAATTTAT GGAGTAAAAA TTCTAGAACC CAAAACAAAG 309 

hbc3133 EST Clone 

5VR65aa2/genban]c • 478 

S'/AAlS9394/ganbank 41S 

consensus AGTTSC^AAT ATCAATTTAT GGAGTAAAAA TTCTAGAACC CAAAACAAAG 700 

R6S9a3/genbanic. ? — , 429 

^hced-6 ci3NA GAAGITCAAC ACAATOSCCA GCTTGATAGA ATATCmTT GTCCAGATQA 740 

/^Ttia PGR fragment GAAC7ITCAAC ACAATTGCCA GCITCATAGA ATATCrmr GTGCAGAT3A 439 

^hbc3123 EST clone /J'GAATTGCCA GCTTCATAGA ATATCTTTTr GTGCAGATGA 39 

S ♦ /RfiHa82/genbanlc 478 

5'/AAl59394/geni3anfc 415 

Canseasua GAi^C?rrCAAC ACAATXGCCA GCTXCATAGA ATATCmTT GTQCAGATGA 750 

R659B3/gonbank. 429 

^hcad-6 CDKA TAAAACIGAC AAGAGGATAT TCACTTICAT ATGCAAAGAT TCTSAOTCaVA 790 

Ttu PGR fragment: TAAAACTGAC AAGAGGATAT TCACTTTCAT ATGCAAAGAT TCTQAGTCAA 489 

,X hbc3123 EST clone TAAAACTGAC AAGAGGATAT TCACTTTCAT ATGCAAAGAT TCTGAGTCAA 89 

5»/R65a82;genbanlt . 478 

5'/AAlS9394/aentoanJc 415 

Consensus TAAAACTGAC AAGAGGATAT TCACmCAT ATGCAAAGAT TCTQAGTCAA 800 

R6S983/genbank. 429 

^00(5-6 CDNA ATAAACATTT CTGCTATGTA TTTGACAGCG AAAAGIGIGC TGAAGAGATC 840 

ArTbe PCR fragmenfc AHAAACATTT GTGCTATGTA TTIQACAGCO AAAACTSTGC TGAAGAGATC 539 

/irhfac3123 EST done ATAAACATTT GTGCI31TGTA TTTOACAGCG AAAAGTGTGC TGAAGAGATC 139 

S • /RSS882/g«nbank 478 

5 ' /AAlS9394/genfaanJc 415 

Consensus AXAAACATTT GTGCTA3PGTA TTTGACAGCS AAAACjIGTGC TGAAGAGATC 850 

RfiS983/ganbank. 429 

^GBd-« CDMA ACTTXAACAA TTGGCCAAGC ATTIGACCTO GCATACACGA AATHCTAGA 890 

ffThs pea fragment ACTTTAACAA TTGGCCAAGC ATITGACCTG GCATACACGA AATrTCTAGA 5S9 

y/hbc3l23 EST done ACTTTAACAA TTGGCCAAGC ATTlwi-CTS GCATACACGA AATTTCTAGA 189 

5'/R658a2/gttnhank r 478 

5VAAl39394/gBnban)c -t 415 

Canaensus ACTTTAACAA TTGGCCAAGC ATTTGACCTG GCATACACGA AATTJCTAGA 900 
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R65983/ganbank. 
/hced-6 cDNA 
^Tha PCR fragment 
if hbc2123 SST clone 
3'/R65SB2/genbaz]}c 
5 ■ /AAl59394/gBnbank 

Consensus 



31/49 

Figure 28C 

hced-6 alignment Fonnatted Alignment 



ATCRGGAGGA AAAGATGTTG AAACAAGAAA ACftGATCGCA CSGGTTACAAA 

ATCAGGAGGA AAAGATC1T3 AAACAAGAJUV ACAG 

ATCAGC3AGGA AAAGAIGTCG AAACAAOAAA. ACAGATCOCA GGCTEACAAA 



ATCAGGAGGA AAAGATGTTG AAACAAGAAA ACAGAICGCX 'GGGTXACAAA 



439 
940 
623 
239 
478 
415 

950 



K659a3/genbank. 
}f hced-€ CDMA 

The PCR fragment 
U hbc3123 EST clone 

5VR65a82/ganbank 

5' /AA159394/genbank 

Consensus 



AAAGAATCCA AGACTEAGAA ACAGAAAATA TCGAACTTAA AAATAAAOTA 



AAAGAATCCA AGACTTAGAA ACAGAAAATA TGGAACTTAA AAATAAAOTA 



AAAGAATCCA AGACTTAGAA ACAGAAAATA TGGAACTIAA AAAT&AACTA 



429 
930 
623 
389 
47 Q 
415 

1000 



RS5983/genbank. 

hced-6 cCKA 

The PCR £ragmenC 

hbc3123 BST clone 

5'/Il65882/genbanlc 

S'/AAlS9394/gentaonk 

Consanaus 



CAAGATTTGG AAAACCAACT G7M3AATAACT CAAGTMCAG CACCTCCACC 1040 
CAAGATTTOO AAAACCAACT GAGAATAACT CAAGEATCAG CACCICCAGC 339 

CAAGATrrOG AAAACCAACT GAGAATAACT CAAGTATCAG CACCTCCAGC lOSO 



RS59Q3/genbank. 
hcBd'6 cDNA 
The PCR fragment 
hbc3123 EST clone 
5'/R6SQS2/genbank: 
5 ' /AA159394/genbank 

Consensus 



429 
1090. 
623 

^ ^ 389 

AGGCAGTATG ACACCTAAGT CGCCCTCCAC TGACATCTTT GATATGATIC 1100 



AGGCAGTATG ACACCTAAGT CGCCCTCCAC TCACATOTT GATATGATIC 
AGGCAGTATG ACACCEAACT CGCCCTCCAC TGACATCTTT GATATGATIC 



R659a3 /genbank . 
hc«d-6 cDNA 
The PCR fragment 
hbc3123 EST clone 
5 * /Il65882/genl3an}c 
5 '/AA1593 94 /genbank 

Consensus 

R65 983 /genbank. 
head- 6 cONA 
The PCR fragment 
hbc3133 EST clone 
S'/RS58Sa /genbank 
5'/AA159394/genbank 

Consensus 



CATTTTCTCC AATATCACAC CAGTCTTCGA TGCCTACTCG CAATGGCACA 
CATTTTCICC AATATCACPC CAGTCTTCGA TGCCTACTCG CAATGGCACA 



439 
1140 
S23 
439 
478 
415 

IISO 

429 
1190 
623 
489 

478 

CAGCCRCCTC CAQTACCTAO TAOATCTACT GAGATTAAAC GGGACCTGTT 1200 



CATTITCTGC AATATCACAC CAGTCTTCGA TGCCTACTCG CAATGGCACA 



CAOCCACCTC CAGTACCTAG TAGATCTACT GAGATiSUAC GGGACCTGTT 



CAGCCACCTC CAGTACCTAG TAGATCTACT GAGATTAAAC GGGACCTGTT 



R659e3/genbank. 
heed- 6 cDNA 
The PCR fragroent 
hbc3123 SST clone 
5 -/R&S 882 /genbank 
5 ' /AA159394/genbank 

Consensus 



429 
1240 
623 
539 

43^3 

TGGAGCAGAA CCnTTOACC CATTTAACTG TGQAGCACCA GATTICCCTC 12S0 



TGGAGCAGAA CCTTTTGACC CATTTAACTG TGGAGCAGCA GATTTCCCTC 
TGGAGCAGAA CCTTTTGACC CATTTAACTG TGGAGCAGCA GATTTCCCTC 



R6S9B3 /genbank. 
hced-6 cDNA 
The PCR fragment 
hbc3123 EST clone 
5 VR63a82/ genbank 
5 ' /AAl59394/genbank 

Consensus 



429 
1290 
623 
589 

, 

45^5 

CAGATATTCA ATCAAAATEA GATGAGATGC AGGAGGGGTT CAAAATGGGA 130t] 



CAGATATTCA A1CAAAATTA GATGAGATGC AGGAGGGGTT CAAAATGGGA 
CAGATATTCA ATCAAAATTA OATGAGATGC AGGAGGGGTT CAAAATQGGA 



R6S983/g«nbank. 
hced-6 cOKA 
The PCR fragment 
hbc3123 RST clone 
5'/R6S6B2/genbank 
5 * /AA153394/genbank 

Consensus 



CTAACTCTTG AAGGCACAGT AXTTRSTCIC GACCCGTTAC ACAGTAGGTG 



CTRACTCTIG AAGGCACAGT ATmcnCTC GACCCGTTAG ACACTAGGTO 
CTAACTCTTQ AAGGCACAGT ATnTGTCTC GACCCGITAC ACAGTAGGTG 



429 
1340 
623 
639 
478 
415 

1350 
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hced-6 CDNA CTGACAKAA GAACAAGAAA TCCTGATTCA TC3TTAAATCT GmtSTAm: 1390 

OliG PCR fragment 623 

hbc3123 EST Clone CTGACASCAA GAACAAGftWV, TCCTGATTCA TCmAAATCTr CJTmTAIAC 689 

5'/R65Ba2/g6nbank 478 

S'/AAl59394/genbank 415 

Conaenaua CTGACXICAA GAACAAGAAA TCCTGATTCA TGTrAAATGT GTHCTATAC 1400 

R65983/genbank. r ^ " 429 

hced-6 CDNA ACATGTCATT TATTATTATT ACTTTAAGAT AGGTATXATT CATtTTGTCAA 1440 

Ttoe PCR fragment ■ ' 6^3 

hbc3123 EST clone ACATOTCATT TATTATEATT ACTITAAGAT AGGTATTATT CATGTGTCAA 739 

5'/R658B2/genbanlc . 478 

S VAAlS9394/genbank 415 

Conaenaua ACATGTCATT TATTATTATr ACTETAAGAT AQGTATTATr CAltyiGTCAA 1450 

R65983/genbank. r 429 

hCBd-S CDMA TGrrTTTTGAA TATPTTAATA TTITGAAAAT TTTCTCAGTT AAATITCCTG 1490 

The PCR fragment 623 

hbc3123 EST clone TGTTTTTGAA TA1TTTAATA TTTTGAAAAT Tl'lCXCACTT AAATTTCCTC 789 

5'/RfiS882/genbanJc r 478 



5VAAlS9394/genbaiik 415 

Consensus TGTTTTTGAA TAlTrTAATA TTTTGAAAAT TITCTCAGTT AAATTTCCTC 1500 



R65983/garibank. — 429 

hced-6 CDNA ACCTTCACTA TTGATCTGTA ATTTTTATTr TAAAAACAGC TTACTGTAAA 1540 

The PGR fragment 623 

hbc3123 EST clone ACCTTCACTA TTGATCTGTA ATTmS^TTT TAAAAACAGC TTACTGTAAA 839 

5'/R6S882/genbanJc 478 

5'/AAl59394/genbank; 415 

Consensus ACCTTCACTA TTGATCTGTA ATTITTATTT TAAAAACAGC TTACltnAAA 15S0 

R65983/genbank. 

hced-6 CCNA GTAGATCATA CnTTATGTT CCTTICTCTT TCTACTGTAG ATOAATTIGT 1590 

The PCR fragment ' ""~ZZZI 

hbc3123 EST cXone GTAGATCATA cmTATGTr CCTTTCrC?rr TCTACTGTAG ATGAATTTGT 889 

5 ' /R65BB2/genbank 470 

S /AAl59394/geiibank r 415 

Consensus GTAGATCATA CTraTATGTT. CCnTCTGrTT TCTACTGTAG ATGAATTTGT 1600 



He5983/genb«nJc. 
hcad-6 CDMA 
The PCR fragment 
hbc3123 EST clone 
5'/R65a82;genbank 
S'/AA159394/genbauk 

consensus 



AATTGAAAGA CATATTATAC AAATACCTGC CTTGTGTCIG AGTTCTATTT 
AATTGAAAGA CATATTATAC AAATACCTGC dTGTGTCTG AGTICTATTT 

AATTGAAAGA CATATTATAC AAATACCTGC CTTGTGTCTG AGTrCTATTT 



429 
1640 
623 
939 
470 
415 

1650 



R659B3/genbank. 

hced-6 cDNA 

The PCR fragment 

hbc3123 EST clone 

5 • /R65882/genbank 

5'/AAl59394/geribank 

Consensus 



429 

AGITAGCATC TTGAAATTTG TATTCATTrr CCAGATGGCT AGnTATTAA 1690 

. 623 

AGTraCCATC TTGAAATTTG TATTCATTTT CCAGATGGCT ACTTTATEAA 989 

■ 478 

^- 415 

AQTTAGCATC TTGAAATTTG TATTCATTTT CCAGATGGCT AGTTEA*tTAA 1700 



R65983/9enbank. 
hced'6 cCWA 
The PCR fragment 
hbc3123 EST clone 
S ' /R6S882/genbank 
5 ' /AAlS9394/genbank 

consensus 



429 
1740 
623 
1039 
478 
415 

TGATTTCCCA AAAGCCATAC CTTAAAGATA ACTTriTAAA TICIQAAGAG 1750 



TGATTTCCCA AAAGCCATAC CTTAAAGATA ACTTm'AAA TTCTGAAGAG 
TGATTTCCCA AAAGCCATAC CTTAAAGATA ACmTTAAA TTCIQAAGAG 



R659a3/ganbank. 

hced-6 cDWA 

The PCR fragment 

hbc3123 EST clone 

5»/R65882/genbank 

S'/AAi59394/genbank 

Cox^enaus 



429 
1790 
623 
1089 
478 

415 

ACATGCCAAT GTCAAACTAA ACATGTTCTG TTTITAAACC AACAAACATG 1800 



ACATGOCAAT GTCAAACTAA ACATGTTCTG TTTTTAAACC AACAAACATG 
ACATGCCAAT GTCAAACTAA ACATGTTCTG TmTAAACC AACAAACATG 



Pigure 28D 



wo 99/37770 
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33/49 



R659a3/ganbaiUc. 
hceJ-S CDMA 
The VCR fragment 
libc3123 EST clone 
5'/Rfi5aa2/genfcank 
S'/AAl59394/genbanlc 

Consensus 



Figure 28E 

hcQd-6 alignment Formatted Alignment 



• 42S 
184C 
623 

iizziiii"! imz ^"^^ 

TERCTATTCA TTGGACAGAT ATCAOTTAT GTATAAATAC tCTTCACATC 1050 



TERCTAITCA TTCaACAGAT ATCATITIAT GTATAAATAC ItTTICACATC 
TEACTATTCA TTGGACAGAT ATCATTTTAT GTATAAATAC TGTICACA1C 



R659Q3/genbemlc. 
heed" 6 cUHh , 
The FCR fragmanc 
hbc3123 SST done 
5'/Re58S2/genbank 
5 > /AAlS9394/genbank 

Consensus 



42B 
L89Q 
'623 
1189 

ACTQGGAAAA TCEftAACTTT AAACATAATG OCACAAGGU: ACTAKmCT 1900 



ACTGOCSAAAA TCTAAACi'i'i' AAACATAATG CCACAACX3TC ACTAAlTICr 
ACIX3GGAAAA TGTAAACTTT AAACATAATG CCACAJM3GTG ACCAATTTCT 



R65983/genbaiiic. 
hcedrS COMA ■ 
The fCR fragment 
hbc3123 EST clone 
5'/RfiS8a2/genhank 
SVAAlS9334/gBnban3c 

Consensus 



429 
1940 
623 
1239 

AGCACGTAAA ATTATAAtSGA TATAAATTCC AATAATAAAC CAAATGTATT 1950 



AGCAGGT7\AA ATIRTAAGGA TATAAATTCC AAaiAAT&AAC CAAAIGTATT 
AGCAGGTAAA ATTATAAGGA TATAAATTCC AATAATflAAC CAAATOTATT 



ReS9B3/genbank. 
nced-6 CDNA 
The PCR fragment 
hbc3123 EST clone 
5 VR65S82/genbank 
S ' /AAl59394/genbank 

Consensus 



429 
1990. 

523 
1289 

, 



TAGACTATXT ATTAGTAAAT GCAAGGTGAT GTTAGTTATG ATCAGTTATA 2000 



TAGAOTATTT ATTACTCAAAT GCAAGGTGAT GTTAGTrATG ATCAGTTATA 
TAGAGTATTT ATTAGTAAAT GCAACCTGAT GTEAGTEATG ATCAQTTATA 



R659a3/genbank. 
head- 6 cONA 
The PCS. fragment 
hbc3123 EST clone 
S * /R6Sa82/genbank 
5 ' /AA159394/genbank 

Consensus 



CTCTAAATAT TTAATrTGlT TTATAAAGGT ACTrOAAAAAA TGAAAATTTG 
CTCTAAATAT TTAAmGTT TTATAAAGGT AQTC3AAAAAA TGAAAATTTG 

CTCTAAATAT TTAAnTGIT TTATAAAGfJT AGTGAAAAAA TGAAAATTTG 



429 
2040 

623 
1339 

478 

415 

2050 



RG59a3/genbank. 

hCBd-6 CCWA 

The PCR fragment 

hbc3123 EST' clone 

S'/R65B83/genbanJc 

5'/AM59394/genbank 

Consensus 



429 
2090 
623 

_ 1389 

; ^5 

CTATTTArnV AAAAACATTA AATTTCATTC CAAATGAGAT AAGTGATAaT 2100 



CTATTTATTA AAAAACATTA AATTTCATTC CAAATGAGAT AAGTGATATT 
CTATnATTA AAAAACATTA AATTTCATTC CAAATGAC3AT AACTTGATATT 



R659S3/genbank. 
hced-6 CDMA 
The PCR fragment 
Iibc3l23 EST clone 
5 ' /Rfi5a82/genbank 
5 ' /AA159394/genbank 

Consenstia 



429 
2140 
623 
1439 

ACTATAACAT CTAAGCATCA TCTSATTTOA TATICCCXRA AAAACATTTS 2 ISO 



ACTAXRACAT CTAAGCATCA TCTGATITCSA TATICCCTAA AAAACmiu 
ACTATAACAT CTAAGCATCA TCIGATTTGA TATTCCCTRA AAAACATTTG 



R65983/genbank. 
hcad-G CDNA 
The PCR fragment 
hbc3123 SST clone 
5 ' /R65302/genbank 
S ' /AA159394/genbaiik 

Consensus 



GAATATATGC TATCEVTAGA TTCAGTATCT ACTOCCCATA TTTACTTTAC 



GAATATATGC TATCTATAGA TTCAGTATCT ACTACCCATA TTTACmAC 
OAATATMPCSC TATCTATAGA TrCAOTATCT ACTACCCATA rSTACmM: 



429 
2190 

623 
1489 

478 

415 

2200 



R65983/genbank. 

hced-6 cGNA 

The PCR fragment 

hbc3123 EST clone 

5'/R65aa2/genbank: 

5'/AA159394/genbank 

Consensus 



CAAATATATT TCTCCICACT GCATAAGGAC TACTCTTCTC ATATmCTT 2240 
CAAATATATT TCTCCTCACT GCATAAGGAC TACTCTICTC ATATTTrCTT '1539 

CAAATATATT TCTCCTCACT GCATAAGGAC TACTCTTCTC ATATTrCClT 2250 



wo 99/37770 



PCTAJS99/01361 



hc«d-6 cISHK 

Tha FCR Uragaient 

hbc3133 EST clone 

S'/R65882/ganbank: 

S ' /AAl59394/genbank 

Connansus 



R655a3/gsnhaitk. 
heed- 6 gDNA. 
The PGR f ragraant 
hbc3123 SST clona 
5 7K65B62/genb&nk 
5 VAAXS9394 /ganbemk 

Consenaus 



R659a3/genbanlc. 

hcad-S cON^ 

Ttia PCR fragment 

hbc3123 EST clona 

5 ' /R6S882/genbank ' 

5'/AAlS9394/ganbanIc 

Consensus 



R65933 /genbank . 
hced-o CDNA 
Tha PCR fr&gmant 
hbc3123 EST clone 
SVRSSa 82 /genbank 
S ' /AAl59394/geRbajik 

Conaensus 



34/49 

Figure 28F 

hc8d-6 alignment Formatted Alignment 



CmSAIGAA QATATTmC ACCAAAGTIT ATTTTGTQAT GCCCTCTIGG 
CmCATGAA GAaaTTTTTC ACCAAAGTIT ATITTGTGAT 




CTTTGATGAA GMWiTWiU ACCAAAGTTT ATTPICTGAT GCCdCTTGG 2300 



TTITGATACT TTAAAATCIG TGGCACCCGT TCTACATGAA TTATCAATAT 
TITTGATACT TTAAAATCTQ TGGCACCCtTT TCTACATGAA TTATCAATAT 

TmOATACT TTAAAATCTG TGGCACCCGT TClftCA3X3AA TTATCAATAT 



TTGGOaAATT CAATCTGTAT - nU ' riTltriT AAAGTCAAAA ATCTCATTTT 
nCOTAAATT GAATCTOTAT TTGTTOCSTT AAAGTCAAAA ATCTCATTTT 



CCAAAAAAAA AAAAAAAAAA AC 
CCAAAAAAAA AAAAAAAAAA AC 



429 
2340 

623 
1639 

478 

415 

. 2350 



429 
2390 

623 
1689 

476 

415 



nGGTAAATT C^ATCTCnAT ' r CGT W I ^^Tl ' AAMSfTCAAAA' ATCTCATTTT 2400 



CCAAAAAAAA AAAAAAAAAA AC 



429 
2412 

623 
1711 

478 

415 

2422 



wo 99/37770 
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hCEl>*6 original 
hCET-S /corrected 



hCED-« original 
hCED- 6 / corrected 

Consensus 



Figure 29 

Untltl6d-5 Formatted Alignment 




so 

50 
50 

100 
100 

100 



hCSD-6 original 
hCSD-6 / corrected 

Consensus 



ISO 
150 

150 



hC£Z}-6 original 
hCED' 5 / correct: ad 

Consanstis 




200 
200 

200 



bCED-€ original 
JsCED-6 /corrected 

Consensus 



250 
250 

250 



hCED-6 original 
hCED-6 /corrected 

Consensus 




300 
300 

300 



hCED-6 original 
hCHH6 / correc t ed 

Consensus 



304 
304 

304 



wo 99/37770 
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Figure BOA 

Untitleci-9 Formatted Alignment 



hced-6 cONA/coding rog. 

hcad-« cOMA/coding reg. /corxwcce 



KraA AC cgiG crrTtwxac omgmmx a&aacxhsga TOCAJACRa: 
nccsuvcctrrs crrrrAccnG csaagsuvagac aaaacatoga tgcassicaqc 

1"^^=^ ' '"' try^f^ ffAar-*!l?trtTii-^ *&axr^T%mr^ ^tv^^^^t;'^ 



SO 
SO 

50 



hced-5 cONA/ coding rag. 

hCBd-fi cDNA/ coding r«g. /correcce 

Consenaus 



gGgC AG u-X-j.-JL'A TCAAAACATT TC ACTCC CTA- liftATGCaAAg 'ITDlTlUCaCft 
'BGAACCms^ TCAAAAOLIT TCATTCCCEA TKAXGCAAAC" 



100 
100 



hcsd-6 cQMH/codixig reg. 

hcad-6 cOMA/coding reg./correcca 

Consfltnsus 



azncAowuTr GcsftACACccA, AAAGGAACiva AAcrccTGaG- AcagG cmia 

OTtCAGUOr GSAACAGCCA AAAGQAACAG A yiX ' iy i yA g/Aa M^Uyj^ 



150 
ISO 



head- 6 cOHA./ coding reg. 

hced-6 cOHA/ coding rag. /corracca 

Consensus 



JUjUTlUUAAa- ACAXAICAAG: A&AOTCCGaAa GCCAGAAAAX 
M3GAAACI3^ AGTTroCHAa ACftTaiCflAO'^KftAICTCgftgii'i^ 



200 
200 



hcad-€ eONA/ coding rag. 

hcad-6 crUA/codiiag rag. /corracce 



BQgCfcAAGTS OAG'lUlSCJUfcA: TKXCAAIT^ TSCJU?tAAAAft 
fiCGSAAACTS GAGTTGCAAAl TTOCKATTTft. ^DOOnxnSUUUfiJ^ 

TT'TTftBacm..flat7TTTTaaa TB^nraaTTTTaincfsacmaftT 'tTTT 'I'antfflc 



250 
2S0 

250 



he ad- 6 cONA/ coding reg. 

hcad-6 cDNA/ coding reg. /correcca 



CpAAAACAAA GSAASTTCAA' CACAAXTGCC. AGCXTCCXTSU: AATKXCtTrt 



SCSWAACAAA OSAAGXTCAA CACUa!IGCC.AGCTTCA33U:: AAICHTCITn 

rpTaaar^itta rimnrrrrraa raraaTnryr ui 



300 
300 

300 



hced-6 cOMA/coding rag. 

hcad-6 c£a«7 coding rag. /correcta 

Consensus 



roiGG ftGAaS ATAAAACXX^ CAAGAGGAfHW TrCA CX ' lUU g>. .TAirGGAAAGA 

xtr r ucAcaro ataaaactga..c:aagaqsaw ticacczsca^'tasocaaaga 

- ' ■ . - _ • :--^ tK'.."'. vf■■'-^ 

lI^a■■J>l^aaA^>c^3tt■ raafsafinaraii TTTTar'n'n^i TaTTf aaacsa 



350 
350 

350 



hced-6 cDKA/ coding rag. 

hced-6 CDMA/coding reg. /corracce 



IKCCTOAGTCA AAXAAACATC TGTGCXIA^GT A'XTjjJA CAGC GAAAACTXOTG 
iXfiULUACTRZA AATAAACATI! TSXGCEFVTGT AXZXGACAflC^ CAAAAGTOIC 
iTV^-nTiafTjira. ;^a'rn^;i^^pqT'f >TTJtTJr-TO'rrr)» ai'^Mviaf^^ry riffiaaB-c-jY-'v. 



400 
400 



hcad-6 CDHA./ coding reg. 

hced-6 CDMA/ coding rag. /corracce 



STCft ACAGAT Ca ClTIA ACA ATTGCSCCAAG OmTGACCT/j 
SICBCaGACAT CACXTTAACA. ATIGCCCXAS CA£L'i*iX3wiCCT.' GQCAS 



hcad-6 cDNA/ ceding rag. 

hcad-6 cJaiA/coding reg. /corracce 



KftAcTnCTAa AAXCAGGACG AAAAGIHXrrr'GAAACAAGSiU AACASATCGC 
KAATTTCXAG AATCSGGAGG ' AAAAGAIGTT GAAAJCAACA3t XftUGfkTqCC 



SOO 
500 



hcad-6 CDMA/ coding reg, 

hcod-6 cCNA/ coding rag. /corracce 



nSGGTTACAA AAAAGAAICC AAfSACTIAGA AACAGAAAAT. A3:GGAACTrA 
AGGGTTACAA. AAAAGAATCC AAGACTIAGA MCAOMAJKr^ J^ 

inrJrmn-n aaaana airr anrtar^ftrta a-araf;AagaT- g'nraanf^ 



550 

sso 

5S0 



heed- 6 cCNA/ coding reg. 

hcad-6 CCNA/ coding rag. /corracce 

Consensus 



RAAATAAAOr ACAAOATITS CAAAACCAAC IGAQVAIAAC 'SCAAJ?CIWTCA 
iMt^AfBCAAjSr ACAAGATTTS GAAAAOCAAC TX3AG»A37^VC .TpAOTJUrCA 



600 
600 

600 



hcad-6 CI3IA/ coding rag. 

hcad-6 cOeiA/ coding reg. /correcce 



SCACCTtZCAG CAOGCACiava: GACACCTAAC TCGCCCTCCA' ffTCaCATCTT 
;jpbCCTCCAG CACGCAGIKT GACACCTAAC TCGCCCtCCA. CTGACAXCTI 



650 
650 

650 



hcad-6 cDNA/ coding reg. 

hcad-6 cONA/ coding rag. /corracce 



IGATMCGATT CCAXriTCl'C CAATATCACA CCAiJlCl'X'CG ATGCCTA CTC 
XSADVXGATr CCATrXTCTC CAAIATCACA CCAGTCTTCG: ATGCCTACXC 



700 
700 



heed- 6 cC^/ coding rag. 

hcad-6 cCNA/ coding reg . /corracce 



SCAmTGGCAC acagccacct CCACTTACCXA gttagaicxac tgacatfaaa 
scaatogcac acagccacct ccaftis^ccta ctacatctac tgaoatxraa 

^ij^mr^^iif* ar^rir-ra/^p-w mnrrr3.nr^\ /yr^r-nTWaf wnf-fLW^an 



750 
750 



hced-6 COMA/ coding reg. 

hced-6 cONA/ coding rag. /corracce 

Consensus 



CGGGACCTGT TICSaAGCAGA ACCTTZTGAC CCA ma ACT CTGGACCMSC 
SOGGACCrGT TXGOAGCAGA. AOCTTTIGAC CCATTI^CTvOXGOftGCACC 



800 
800 

800 



hcad-6 cOKA/coding reg. 

hcBd-6 cCMA/ coding reg. /corracce 



CCAGATS^TTC AATCAAAATT AGATOAJCUUrG CAGGAGGGGT 
i ' l ' lU-U ' i ' CCAGAXATZC AAXCAAAATT AGATGAOASG CAGGACGGGI 
f<'**r;p.']*p.*T*pr aa»Tr'^a&a*T*r &r;*7Y;^'^-*^ .Of 'f Vr^'*-''^''''^ 



850 

aso 
aso 
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37/^9 
Figure 30B 



Untltlecl-9 Formatted Alignment 



h£!«i-6 c£2NA/ coding reg. 

Sced-6 cDNR/coding reg. /correcta 

Consensus 

hced-6 cDNA/coding reg. ^^^^^ 
hc8d-S cmA/coding reg./correcce 

Consensus 




915 



wo 99/37770 
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wo 99/37770 
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B. 



39/49 



CO 

•a 

CD 
C 
O 

f 

0) 

CO 

a. 

O 

a 

"3 
a 




ced'6(n1813); 
hs::hced'6, 1 



ced-6(n1813): 
hs::hced'6,2 



wo 99/37770 
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1 50 
car,s«nsus (SGTCATSAGC CCTTGC-G-TTC TCGCTCCGAC TGCTAAATTC GCTTGGCCGG 
Seq GGTGA7GAGC CCTTGGGTTC TCGCTCCGAC TaCTAAATTC -GCTTCSCCGG 

en(=ix7^a4 ..tgatgagc ccttgccjttc tcgctccgac xccTAAArrc ccttgggccc 

r53532 *- -GCTAAAXrC GCTTCCCCGC 

aal3 9394 GGTGATGAGC CCTTGGGTTC TCGCTCCGAC TGCTAAArTC GCTTGGCCGG 
eLA2 63tl^ . .TtSATGAGC CCTTGGGTTC TCGCTCCGAC TGCTAAATTC GCTTGGCCGG 

SI 100 

COnssr.SUS CTCCACGTTC TCGTGGCCTC ACTCGCCACA CGGATCAGAA TCCGC-AGCAC 

Seer CrCCACCTTC TCCTCGCCTC ACTCGCCAC.0^ CGGATCAGAA TCCGGAGCAG 

Chcll7^a^ CTCCACCTTC TCGTGGCCTC ACTCGCCACA CGCATCAGAA TCCGGAGCAG 

r539H2 crCCACCTTC TCGTGGCCTC ACTCGCCACA CGGATCAGAA TCCGGAGCAG 

a^I5333<l CTZCACCTTC TCGTGGCCTC ACTCGCCACA CGQATCAGAA TCCGGAGCAG 

5^*3^9710. CTCCACCTTC TCGTCGCCT=: ACTCGCCACA CGGATCAGAA TCCGGAGCAG 

la; isa 
canse.-sca ccacttctct ctattctgag cctcctgcgc c tccc ccg ctgacttccc 
sa:; ac^kCTTCTCT ctattctcag gctcctgcgg c tccc ccg ctgacttccc 

Chcll'J^ST C--ACTTCTCT CTATTCTGAG GCTCCTCCCC C. TGCCGGC . TGACTTCCC* 
GCAGTTCTCT CTATTCTGAG CCTCCTGCGC CTCCCGCC TGACTTCCC . 
aalS333-k CCACTTCTCT CTATTCTCAC CCTCCTGCCC OrTGCCMGCG TGACTTCCCG 



FIGURE 32A 



wo 99/37770 



PCTAJS99/01361 
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aa.3 6 37H CCASTTCTWT CTATTCTGAC CCTCCTNCGG C.TCCCCCCC TGACTTCCC 



Seq 

ChcJL174a4 
rS53B2 



chcll74a-i 



151 

TG-rCTGCNCG 
TGTGTGCGGG 
TGTGTGCGGG 
rGTGTGCGGG 
TGTGTGGNGG 
TGTGTGCGGQ 

zai 

CGAT . GTTGA 



AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 
AGGGAACTCT 

AGGGAACTCT 



■CGGCAGGCTG 
GCCCAGGCTG 
GGGCAGGCTG 
GCCCAGGCTG 
GGGCAGGCTG 
GGGCAGGCTG 



GrrrrcTTCG 

GTTrrCTTGG 

errrxcTTCG 

GTTTTCTTGG 
GTTTOTTTGG 



2QQ 

AATCrrGTTTA 
AATGTQTTTA 
AAXGTGTTTA 
AATG TGTTT A 
AATCTGTTTA 
AATCTCTTTA 



ATSGGACTTG AACAGG. .AA 



350 

SCTGGACGCT GCA.GCTGGA 



CGAT GTTGA 



CGAT. GTTGA 
CGAT. GTTGA 
CGAT . GTTGA 
CGAXraTTGA 



primer aGA104 

ATCCGACTTG AACAGG AA 
CTTG AAACGGGNAA 
ATGGGACTTG AACAGG. ,AA 
ATGCGACTTG AACACG . , AA 
ATGGGACTTG AACAGC,.AA 
ATGGGACTTG AACACG. «AA 



prirti'sr oGAias 



GCTGGACCCT 
CCCGGCCNCT 
CCTCGACGCT 
GCTGGACCCT 
GCTGGACGCT 
CCTCGACGCT 



GCA GCTCGA 
GCAAGCSrGGA 
GCA . GCTGGA 
GCA.GCTGGA 
GCA , GCTGOA 
GCA 



251 

ACTAGCGTGC 
ACTAGCGTGC 
ACTACCGTOC 
ACTAGCGTGC 
ACTAGCGTGC 
ACTAGCGTGC 



C, AACTTATT 
C AACTTATT 
CCAAGTTATT 
C . AACTTATT 
C. AACTTATT 
C . AACTTATT 



TATGATTCC. 

TATGArrCC 

TATCMICCCC 

TATGATTCC. 

TATGATTCC, 

TATGATTCC - 



ATCTGATATA 
ATCTGATATA 
ACCTGATATA 
ATCTGATATA 
ATCTGATATA 
ATCTG2rCATA 



2QC 

CATACCAGAC 
CATAGGAGAG 
CATCCGAGAG 
CATAGGAG3jG 
CATAGGAGAG 
CATAGGAGAG 



GGAXQZ 
r3S3S3=rcc 
Chcl.l743-i 



3GL 

AAACT GATA GAAGAATTGT GATC3GCAACT 



350 

GTATGATAG AAGCTAT AT 



CT GATA GAAGAATTCT 



primgr 445 *02F 
priaar oGAXa2 

GATGGCAACT GTATGATAG AAGCTAT AT 
- -TA 



AAACT. GATA GAAGAATTCT GATGGCAACT 
AAACT. GATA GAAGAATTCT CAtGGCAACT 
AAACT. GATA GAAGAATTCT CATCCCAACT 
AAACTTGATA GAAGAATTCT GATCCCAACT 



.GTATGATAG AACCTAT.AT 
.GXATGATAC; AAGCTAT, AT 
.CTATGATAC AAGCTAT. AT 
.GTATGATAG AAGCTAT, AT 



aGA10 3 

thcll74a4 
rff39S2 

aai3333-; 



cGAi32 

rfiS932 
aaI59394 



3S1 

AAAGTCAAGT 



AAACTCAACT 
CAAGTCA-GT 
AAAGTCAAGT 
AAAGTCAAGT 
AAAGTCAAGT 
AAAGTC.3wa.GT 



GTCCATTTTC 



GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 
GTCCATTTTC 



T^-AGTCGTCC AACTGAACAT 

TAAGTCGTCG AACTGAACAT 
TAACTCCTGG AACTGAACAT 
TAAGTCGTCG AACTGAACAT 
TAACTCCTCC AACTGAACAT 
TAAGTCGTCG AACTGAACAT 
tAAGTCGTCC AACTGAACAT 



TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 
TTTCAACTAT 



ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 
ATTTGAGCAT 



4C0 

ACCCACGATT 
ACCCAGG ATT 
ACCCACCiATT 
ACCCACGATT 
ACCCAGGGTT 
ACCCAGGGTT 
ACCCACGATT 



4.S0 

TTATTTCC CT GATCCTCATC ATG . AACC CT 

Primer 44S -X093-; ^O? -jt 
TTArrrCGCT catcctcatc atg aaccgt 

TTATTTCCCT GATCCTCATC ATG. AACCGT 
TTATTTGGCT GATCCTCATC ATG. AACCGT 
TTATTTGGCT CATCCTCATC ATGGAACCGT 

TTArrrCGCT gatcctcatc atggaaccgt 

TAT 



CED-6 



MAXCIYXTFK RSVSCrVCGW JfI?JGSGSS5? STSABQVKT^'R GGTG - 



FIGURE 32B 



wo 99/37770 



PCT/US99/01361 



csn-s 
hcro-fi 



seq 



A 
431 
GCT 

CCT 
OCT 

<2c: 

OCT 



42/49 

R T W 
R K iC C K T 



TTTAGCA GGAAGAAAGA 

rrrAGCA cgaagaaaoa 

TTTAGCA QGAACAAACA 

■rrrACCA ggaagaaaga 

mASCA COAACAAACA 

CA gcaacaaag::. 



CAAAACATCC 
CAAAACATCG 
CAAAi^CATCC 
CAAAACATGG 
CAWkACATGS 
CAAAACATGG 



I K P 
M K C 

ATCCATACAC 
ATCCATACAC 
ArsCATACAC 
ATCCATACAC 
CTGCTSIACAC 
GTGCTJTACAC 



P - I L 

P E A t. 

500 

cTGAACcrrr 

CTGAAGCTTT 
CTGAAGCTTT 
CrCAAGCTXr 
CTCAAG.NTr 

CTGAAG.xrrr 





I K G 


K V E 




5 X H 


r I F 




5wl 




cans ensus 


ATCAAAACAT 


TTCATTCCCT 


Seq 


ATCAAAACAT 


TTCATTCCCT 


OC3AXa2 


ATCAAAACAT 


TTCATTCCCT 




ArCAAAACAT 


TTCATTCCCT 




ATCAAAAC.W 


TTCTi'TCCNA 




ATCAAAAC-tT 


TTCTTTCCSIA 




r p X A 


>r c s 




%• s sa p 


IC C T 




SSI 








AAAAGGAACA 




TGGAACAGCC 


AAAACGAACA 


QGA102 


XGGAACAGCC 


AAAAGCAACA 






AAAACGAACA 




arc 


X D t* X 




iC 7 A 


K K I K 



V A 
W A 



X 



G 

a 



c 
s 



V 

T 



£ 
550 

AGTACAcSAAiS 
AGTAjCAGAAG 
ACTACAGAAG 
AOTACAGAAG 



V 
V 



R D 



I 

V 



A I 

:< L 

tfaa 



IGTGA GACATGCTGT AAGGAJKACTA 



s 

c 



QTSIXTAX 

G C K 



Q 
P 



consensus 
aGA:.C3 



ICAA GACATATCAA GAAATCTGAA GGCCAGAAAA TT 



2C1 

AAjG«i.«4wwr% » — - 

A,sC3TTTaCAA GACATATCAA CAAATCTGAA CGwCfWAAAA 

AACTTTGCAA GACATATCAA GAAILTCTGAA GGCCAGAAAA 

AACTTTGCAA GACATOTCAA GAAATCTGAA GGCCAAAAAA 

.ACTTTGCAA GACATATCAA GAAATCTGAA GGCCAGAAAA 



K V 
tfSC 

rrCCTAAAGT 
TCCTAAAGT 
TTCCTAAAGT 
AAAAAAAAG , 
TTCCTAAAG- 



hCED-5 



CSD-tf 



CSO-fi 
hCSO-€ 



conscixsus 
Set; 
OGAIOI 



OCA1Q2 
dS27 3 * 



GGAG-TTGCAA ATATCAATTT ATGGAGTAAA ^rS^^A^A 

CGAGTTGCAA ATATCAA-TTT ATGGAGTAAA AATTCTACAA «CA^^ 

GGAGTTGCAA ATATCAATTT ATGGAGTAAA AATTCTACAA C-^AAA^CA-. 

C-GA.GTTGCAA ATATCAATTT , ATGCACTAAA AATTCTACAA CCC 



X 



A 



I 



K 



^:^7^f yt? pl 

KEVG Wr^C GC- 
AGCAACTTCA ACACAATTCC CAGC 



c 



s 

s 



c 
c 



7CATA gaatatct: 



A D 
A C 
7SC 

TTGTGCAGAT 



^^AAGTTCA ACACAATTCC CAGCTTO.TA GAATATCTTT TTGTGCAGA;. 
CAATTCC CAGCTTCATAGSi;^ArATCTTOac.vaTGCAOAT 



z :< 

Z K 
7! ; 



c 



2C 

:< 



r s 

7 X 



A 

C 



HAS C 

K C S 



\, s 
aoQ 



AT.^A.iJ^CTC ACAAGACGAT ATTCACTTTC ATATGCAAAG ^^^^^^^^ 
ATAAAACTC ACAACAGCAT ATTCACTTTC ATATGCAA^C A 



FIGtJM 32C 
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CSD-< 



h.CSD-5 



43/49 

da 27 a 7 CATAAAACXO XCAAGACCAT ATTCACTTTC ATATGCAAAG 



ATTCTGACTC 



E 



X I. 



A. 
A 



C X V SCV AyTS 
KKK Ley VFDS 

cans eti^u* AAATJJXJXCAr TTGTCCTATG TATTTGACAG CGAAAAGTGT GCTGAAGAGA 
Seq 

aGA102 , , 



D 
830 



CTOAAGAGA 

* ^ CTGAAGAGA 

r7637a AAATAAACAT TXGTGCTATG TAXTTGACAG CGAAAACTCT GTTAACTAXCC 

aa3a79a2 

da27S7 



• -GT GCTGAAGAGA 

ATAAACAT TTGTGCTATG TAXTTGACAG CGKAAAAGTGTGCTCAACACLa. 



CEO-ff 



I T 
331 



T 



I 



c 
G 



Q 



A 
A 



7 
F 



t. A Y 
LAY 



X 
T 



pCAIOl 

QGAia2 
r7«'37 8 

aa3a7932 
dS27a7 



eona«zisus 



TCACTTTAAC AATTGGCCAA GCATTTGA. . CCTCGCATAC 



TCA CTTT AAC AATTGGCCAA GCArXTGA 
TCACTTTAAC AATTGGCCAA GCATTTGA., 
CACATGTTGT AGGGCTGGTT TGTTCTCTTT 
TCACTTTAAC AATTGGCCAA GCATTTOA- . 
TCACTTTAAC AATTC3CCCAA GCATTTaKN 



CCTGGCATAC 
CCTGGCATAC 
TAXAAGjrCC 
CCTGGCATAC 
CTGGCATAC 



y 

F 
300 

AGGAAATTTC 
TC 

AGGAAATTTC 
AGGAAATTTC 
GCGOATTqTC 
AGQAAATTTC 
AGGAAATTTC 



L t} 
L E 

TAGAA.TCAC 



3t 
3 



G G 



n 



L 

V 



T 



Q 



X 



CACGAAJSJLGA TCTTtSAAJUTa. AGAAAA- 



Q I V I 
C 1 A C 

-C AGATCSCACa 



oGAlQl 
Sac; 
aGA102 
aa3 07332 
da27a7 



pGAiai 

OCA107 
OCAX02 

r7tf37a 

aa307S32 
(432737 



TCAG 
TA3AA TCAG 
TAGAA.TCAG 
TAGAA,TCAG 
T:JGf=^'\.TCAG 



GAG/3AAAACA 
GACGAAAAGA 
GAGGAAAAGA 
GAGGAAAAGA 
GAGGAAAAGA 



Prime J cGAia7-y 
Primer 445-10934-04 

Priaier 445-10334 -oa-R 
TGTTGAAACA AGAAAA C AGATCGCAGG 
TGTTGAAACA AGAAAA 
TGTTGAAACA AGAAAA. 
TGTTGAAACA AGAAAA. 
TGTTGAAACA AGAAAA, . 



C AGATCGCAGG 
.C AGATCGCAGG 
. C AGATCGCAGG 
.C AGATCGCAGG 



K 



K 
R 



X V 



GTTACAAAAA 
GTTACAAAAA 
STTACAAAAA 

GTTACAAAAA 

CTTG 

GTTACAAAAA 
GTTACAAAAA 



ELET EWQ VLI 
DtfiT ^ y u 

1000 

ACTXAGAAAC AG.AAAXTAT GGAACTTAAA 
ACTTAGAAAC AG AAAATAT GGAACTTAAA 
ACTTAGAAAC AG AAAATAT GGAACTTAAA 

AAA 

AGAATCCAAG ACTTAGAAAC AGGAAAATAT GGAACTTAAA 

AGAATCCAAG ACTTAGAAAC ACAAAATAT GGAACTTAAA 
AGACTCCANG ACTTAGAAAC AC AAAATAT GGT 



AGAATCCAAG 
ACAATCCAAG 
AGAATCCAAG 



R 
V 



pGAXSl 

OGAI 0 T 
OGA10 3 
aA307932 



X E 
^ X 

AATAA.=lGTAC 
AATAAACTAC 
AATAAAGTAC 
AATA.VACTAC 
AATAAAGTAC 
AATAAAGTAC 



A £ 



A L 
E ^ 



R A 



R 



K 
T 



A . AGATTTGG 
A AGATTTGG 
A AGATTTGG 
A ACATTTCC 
A. ACATTTGO 
A . ACATTTGG 



AAA-\CCAACT 
AAAACCAACT 
AAAACCAACT 
AAAACCAACT 
AAAACCAACT 
AAaACCAACT 



CAGAATAACT 
GAGAATAACT 
CAGAATAACT 
GAGAATAACT 
GA^rAATAACT 
GAGAATAa.CT 



A, O ^ 
Q V S 

1050 
CAACTATCAG 
CAAGTATCAG 
CAACTATCAG 
CAAGTATCAG 
CAAGTATCAG 
CAAGTATCAG 



2 K 
A P 

105: 



T 
P 



G 
A 



P 

G 



P 
S 



1 
M 



T ? 



G 
K 



s 



G 



P 

c 



P A 
T -0 
1100 
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CED-6 



csn-e 



Keen- 6 



CSD'6 
hCSD-tf 



CE0-6 



44/49 



COrt^«ti*UJP CACCTCCACC 

pGAia 1 CJ^CXCCAGC 

sect C^rrCTCCACC 

CGAI Q *7 CACtrXCCAGC 

OGAl 0 2 CACCTCCAGC 

aA3 0 73 a 2 CACCTCCAGC 

Mt*S?l£ P Q S 
I P D 

ATCTTTGATA 

ATCTTTGATA 
1 ILSATA 
A-^iili'G, AT 



ACC-CA. 
AG£3 CA 
AGG CA 
AGG CA 
AGCGCAA 
ACGCA. . 



.CT ATGACACCTA 
CT ATGACACCTA 
C3T ATGACACCTA 
GT ATGACACCTA 

.err ATGACAccrr 

.GT ATGACACCTA 



AG, .TCGCCC 
AG TCGCCC 
AC TCGCCC 
AG TCGCCC 
AASTTCGCCC 
AG. .TCGCCC 



pGAlOl 
Sat? 

oGAia7 

OGA102 
aa3079a2 



R 
T 



^iGAlOl. 

See; 
cGAiO"? 
dGAia2 



pGJUOl 

cGAiO'? 
OGA102 



K 
H 

GATGCCTAC . 



GATGCCTAC 

CATGCCTAC 

GATGCCTAC 

SATGCCTTCC 

GATGCCTAC - 

GATGCCTAC. 



r L F <a 
?. S T E 

CATCTACTGA 
CATCTACTGA 
GATCTACTGA 
CAiTCTACTGA 

1X;1JTG 

GATCTJUZTGA 
ATCAAAGCAT 



P P P 

M 1 P 

, TCATTCCAT 
TGATTCCAT 
TGiATTCCAr 
TGATTCCAT 
ATCATTCCCT 
ATGATTCCAT 
CCAT 

A M 

R N 

, TCCCAAT . . 

TCCCAAT 
-TCCCAAT 
TCCCAAT 

TTCGCAATTG 

T . CGCAAT . . 

TCGCAAT 

.CAGCAAGTC 



F 



P 
I 



5 5 

S H 



TTTC, ,TCCA 
TTTC TCCA 
TTTC TCCA 
TTTC TCCA 
TTTTCTXCCA 
XTTCT. . .CC 
TTTCT CC 

13 L P 

G T a 



ATAT-CACaC 
ATAT CACAC 
ATAT CACAC 
ATAT CACAC 
ATATTCACAC 
AATATCACAC 
AATATCACA 



P 

P 



P 



S 
P 



GGCACACAGC C . AC CTC ■ CA 



TCCACT.GAC 
TCCACT CAC 
TCCACr GAC 
TCCACT GAC 
TCCACXTGAC 
TCCACT, GAC 

X Y S 
Q S S 

liso 

C.AGTCTTC- 
C AGTCTTC 
C AGTCTTC 
C AGTCTTC 
CCAGTATTOT 
C. AGTCTTC. 
CGAGTCTTC, 

•MAP 
VPS 

1200 
GTACCTAGTA 



CGCACACAGC 

GGCACACAGC 
GNACCACAGC 
GGCACACAGC 
GGCACACAGC 
AACATTTGAC 



C ACCTC CA 
C AjCCTC CA 
C ACCTC CA 
CCACCTTWCA 
C. ACCTC. CA 
C ACCTC CA 
ATATAGTTAT 



GTACCTACTA 
GTACCTAGTA 
GTACCTAGTA 
GTTCCTTAGT 
GT ACCTAG , - 
GTACCTAGTA 
TTATTAGTTG 



5 



2 
D 



? 



V s 
D P 
12S0 



GATTiU^ACGG GACCTGTTTG GAGCAGAACC TTTTGACC^ 
CA-^AAACGG GACCTCTTTG CI^GCACAACC TTTT GACCCA 
GATTAAACGG GACCrGTTTG GAGCAGAACC TTTTGACCCA 
GATTAAA2JCC GACCTCrTTG GAGCAGAACC TTTTGACCCA 

okxTJ^CGO GACCTGTTTG GAGCRGAACC TTTrGAC CCA 
GAATATTTCA ACTTTACTGT TCACTGATTT TATITXGCTG 



pGAiai 

Sccr 

cCAia? 

aa J. 3 X 9 9 s TAACATTT 



?AS TSP5 GPA ySX 

?NC GAAD 5FP CIQ 
1251 

•^S AgTgTG GAGCAGCAGA TTTCCCTCC A GATATTCAAT 

rrTAACTCTG GAGCAGCAQA TTTCCCTCCA 

TTTAACTCTa OAGCAGCAGA TTTCCCTCCA 

TTTAACTGTG GAGCACCACA TTTCCCTCCA 

TAACTCTC CACCAGCACA rtTCCCTCCA 

^A CA 



P P P A 

S K L D- 
1300 
Q!^AAATTAjfiA 



CATACTC TTTTT: 



GATATTCAAT 
GATATTCAAT 
GATATTCAAT 
CATATTCAAT 
GATATTAATT 



CAAAATTAGA 
CAvrpATTAGA 
CAAAATTAGA 
CAAAATTAGA 
ATTTTATTCT 



T 



S 
Q 



9 

E 



G 
F 



A 
M 



P 

G 



R P 
T V 

MCI ^^^^ 

consensus tcIcatgcag qaggggttca aaatgccact ^CT*=3^,°25t^^^J^? 

PCAIOI TCACATGCAG GAGCGGT7CAAAAATGGGACT ^EI^S^CaSt 

S«q TGAGATGCAG GAGGQGTTCA AAATGC-GACT AACTCTTCAA ^^^^ff?!^ 

oCAiar TCACATGCAG GACGGGTTCA AAATGCCACT AACTCTTGAA ^-CACA^ 
cG^MOa C QACCGGTTCA AAATCGCACT AACTCTTCAA CwCACACnr^ 
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3.335 



-CA'-ATCCAC CAGCGGTTCA AA^kTGGGACT i»J\CrCXTGAA CGCACAGTAT 
GAGGGCTTCA AAATCCCACT AACTCTTOAA CGOCAQTAT. 



CSD-tf 



- C 
1351 



V 



P 



P V A 
SRC 



cotiseasus TrnrarcrcQA ccccrrrAaAC 

scq rrTGTsrrcQA cccgttacac 

oaAlt3 7 rTTGTCTCCA CCCCTTAflAC 

oGAiaS TTTGTCTCCA CCCGCTP^C 

aa4,.43 3Sa TTTSTCrCtSA CCCCrTAGAC 

aa4 3 3.d9 3 TTTOTCTCCA CCCcrrtAGAC 



AGTAC CTGCt GACAXCAACA 

primer 445*10934 



1400 
ACAAGAAATC 



AGTACGTOCT GAjCATCAAGA 
ACTAGCTGCT OACATCAAGA 
AGTAGGTGCT GACATCAACSA 
AGTAGGTGCT GACATCAAGA 
AGTAGGTGCr GACATCAAGA 



ACAAGAAATC 
ACAAGAAATC 
ACAAGAAATC 
ACAAQAAATG 
ACAAGAAATC 



SSSS ISSSJ^ ^i30S5E?Ga MTOOLfi«S SSSOVOTPSD R^VJX«««.KO 



oCAXa? 
oGAias 

oGAXOa 
Aa4433*3 
aA4 313S3 



OGA107 
oOAXca 
o<3AX3 3 
SUl443a«3 
aa43X53S 

r33333 



QGAXaS 

aCAXoa 

a&4431£a 
aa43X333 

r333 3 5 



14CX 

CTGATTCATG 



^r^^auTfl'TSr rrGTATAC.A CATCTCATTT 



primer aCMQ^"? 
Sq<5 ctgattcatc ttaaatgtct 

TTSwXATGITGT 



CTGATTCAr:? 



CTGATTCATC rrJUiATGTGT 
CTGATTCZ-TG TTAAATGTGT 
CrCATTCATG TTAAAXCTGT 
r3 3 3a9 CTC ATTCATG TTAAATGTGT 



TTGTATAC A 
TTGTATAC A 

TTGTATAC A 
TTGTATACA 
TT3TATAC.A 
TTGXATACA 



CATGTCATTT 
CATGTCATTT 
ACTGTTCAXT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 
CATGTCATTT 



X4S0 
ATTATTATTA 

ATTATTATTA 
ATTATTATTA 
ATTATTATT 
ATTATTATTA 

ArrATTArxA 

ATTATTATTA 
ATTATTATTA 



1451 

CTTTAAGATA 

CTTTAAGATA 

CTTTAAGATA 

CTTTAilNAA 

CTTTAACATiSf 

CTTTAAGATA 

CTTTAAGATA 



GGTATtA.r: 
GGTATTA XT 
GGTATTA TT 
QGTTATTATT 
GGTATTA TT 
GGTATTA TT 
GGTATTA TT 



GcrrATTA.TT 
GGTATTA. TT 
GGTATTA. TT 



AAGATA 

. . , TAACATA 
CTTTAAGATA 

1501' 

TTTTGAAAAT 
TTTTGAAAAT 
TTT^nrAAAAT 
TTTTGAAAAT 
TTTTGAAAAT 
TTTTGAAA.\T 
TTTTGAAAAT 
TTTTGA*VAAT 
TTTTGAAAAT 
TTT TCA-XAAT 



ISSl 

CTCTAATTTT TATTITAAAA ACAGCTTACT_£^ 



X3Q0 

CATGTGTCAA TCTTTtTGAA TATTTTAATA 
CATGTGTCAA TCTTTTTGAA TATTTTAATA 
CATGTGTCAATGGTTTTTTGAATATTTTAATA 
KTGC3OTCA OrrrTTMTAA TATTTTAATA 
CATGTGTCAA TCTTTTTGAA TATTTTAATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 
C3.TGTGTCAA TGTTTTTGAA TATTTTAATA 

_ ,GATA 

TTTGAA TATTTTAATA 



CATGTGTCAA TGT; 
CATGTGTCAA TGTTTTTGAA T ATTTTA ATA 
CATGTGTCAA TGTTTTTGAA TATTTTAATA 



XS30 
CACTATTGAT 
CACTATTOAT 
CACTATT^IWAX 
CACTATTGAT 
CACTATTGAT 
CACTATTGAT 
CACTATTGAT 
CACTATTGAT 
CACTATTGAT 
CACTATTGAT 



TTTCTCAGTT 


AAATTTCCT. 






TTTCTCAOTT 


AAATTTCCT 


CACCT 


T 


rrrcTCAWTT 


AAATTTCCT 


CACCT 




TTTCTCAGTT 


AAATTTCCT 


CACCT 


T 


TTTCTCAGTT 


AAATTTCCT 


CACCT 


T 


TTTCTCAGTT 


AAATTTCCT 


CACCT 


T 


TTTCTCAGTT 




CACCT. . 


. -T 


TTTCTCAGTT 


AAATTTCCT . 


CACCT. . 




TTTCTCAGTT 


XULTTTCCT . 


CACCT . . 




TTTCTCAGTT 


AA^.TTTCCT. 


CACCT . . 





.TAAAGT AC A. 



LlOO 
■ TCATA 




FIGTJRE 32P 
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r3338S 



COM ana us 

rsiaai 
r333e3 



GGA10 9 

aa43199S 
753331 

1103743 
ff3 333 9 



acq 
O0A13S 

■ cGAiLQ 

rS3331 
rS223S 

r333S9 



rS323 S 



Sec; 
oCAilO 
r53aaL 

r^223*5 



CTcrAATTrr 

CTGTXATTTT 

CTCTAA rTTT 
CT3TAATXTT 
CToTAATTTT 




46/49 

TATTTTAAAA 
TATTTTAXAA 
TATTTTAAAA 
TATTTTAAAA 
TATrCTAAAA 



T-rccTTTcrrG 
TTCcrrrcTC 

rrccrrTCTG 

TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 
TTCCTTTCTG 



PCT/US99/01361 



JlCAGCTTACT gt 

ACACCTTACT C TAAAGT AC A TCATA 
ACAGCTXACT G. , .TAAAGT AG.A.XCAXA 

ACACcrrAcr g..,taaagt ag.a.tcata 

ACAGCTXACT G . . - TAPAGT AG . A . TCATA 
ACACCTTACT G . . . TAAAGT AGGA , TCATA 



TTTCTACTGT 

TTTCTACTGT 

TTTCTACCC 

TTTCTACTGT 

TTTCTACTGT 

rrrcTACTGT 

TTTCTACTGT 
TTTCTACTGT 
TTTCTACTGT 



AG AT . . CAAT 
AGAT GAAT 
TNKA OTAA 
AGAT CAAT 
AGAT GAP.T 
AGAT. .GAAT 
AGAT.'.CSAAT 
AGAT. .GAAT 
AGGATGOAAT 



1531 

AAG . ACATAT 

A^C- ACATAT 
A.*rr ACATAT 
AAG ACATAT 
AAG ACATAT 
AAC, ACATAT 
AJ^G. ACATAT 
AJ!i3,ACA,TAT 
AAGGACATAT 

AGCATCTTO 

AGC ATCTTG 
,^Tr ATCTO-r 
AGr ATCTTG 

AGC. ATCTTG 
AGCAtCTTC 
AGC. ATCTTG 
GC-CCATCTGG 



^^^r.f^^Ji.^ ^rX eCTGCCTTC T GTCTGAG . TT 

tarimer 445-10934 *0g -7 
TATACAAATA CCtCCCTTGT GTCTGAG TT 
TATACAAATA GCCACCTTAilGATCDDmTT 
TATACA^ATA CCTGCCTTCT GTCTGAG TT 
TATACAAATA CCTGCCTTGT GTCTGAG TT 
TATACAAa^TA CCTGCCTTGT GTCTGAG. TT 
TATACAAATA C-CTGCCTTGT GTCTGAG. TT 
TATACAAATA CCTGCCTTGT GTCTGAG. TT 
TATACAAATA CCTGCCTTGT GTCTGAGGTT 



1C50 
TTCTAATTGA 
TTGTAATTGA 
TTGKTAATTATA 
TTGTAATTGA 
TTGTAATTGA 
TTGTAATTGA 
TTGTAATTGA 
TTGTPlATTGA 
TTGTAATTGG 

1700 
CTAi' i TAOTT 

CTATTTAGTT 
CTATTTATTT 
CTATTTAGTT 
C 

CTArTTAGTT 
CTAtCTAGTT 
crATTTAGTT 
CTATTAGGTA 



i^AATTT GTA*: 



Pr 

KAATTTGTAT 
AAATTGATAT 
AAATTTCTAT 

AAATTTGTAT 
AAATTTGTAr 
AAATTTGTAT 
AAATTTGTAT 



i7sa 

TCATTtTCC\ gATGGCTAGT TTATTAATGA 
liwr OGAIIQ-F 

TCATTTTCCACGATGGCTAGT TTATTAATGA 
TCAtTTTCCA TACGSrCTCTTTTATTAAGaiAT 
TCATTTTCCA GATGGCTAGT TTATTAATGA 

CTTTAATGA 
^GATTTTCCA GATGGCTAGT TTATTAATGA 
TCATTTTCCA GATGGCTAGT TTATTAATGA 
TCATTTTCCA GATGGCTAGT TTATTAATGA 
TCATT. . , 



ITSI 

TTTCCCAAAA CCCATACCTT AAAG.ATAAC 

TTTCCCAAAA GCCATACCTT AAAG ATAAC 
TTCCCAAAW TCCATACCTT A^T ATAAC 
TTTCCCAAAA GCCATACCTT AAAG ATAAC 
TTTCCCAAAA GCCATACCTT A^^G ^T.^AC 
TTTCCCAAAA GCCATACCTT ^AAC . AT-iAC 
TTTCCCAAAA GCCATACCTT AAAC- ATAAC 
TTTCCCAAAA GCCATACCTT ;iAAGCATiAC 



TTTTTAAATT 
printer 445 



CTGAAGA. .q 
-10 9 34-X2-R 



TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAAATT 
TTTTTAJLATT 
TTTTTAAATT 



ctcaaact; 



ACATGCCAAT GTCAAACTAA 

ACA:rrccAA gttcaaactaa 

ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 
ACATGCCAAT GTCAAACTAA 



ACATGTTCTG 
ACATGTTCTG 
AC.s^rTTCCJJ 
ACATGTT CTG 
ACATGTTCTO 
ACATGTTCTG 
rCTG 



TTTTTAAA, 
TTTTTAAA 
TTTTTAAA. 

rrrTTAAA 




CTGAAGA G 
TNTAATA T 
CTGAAGA G 
CTGAAGA G 
CTGAACG. .C 
CTGGAAGA.G 
CTCGAAGGNG 

laso 

CAACAAACAT 
CAACAAACAT 
caacaaaca:? 

CAACAAACAT 
CAACAAACAT 
CAACAAACAT 



rAAAAC CAACJULACAT 



riGTJKE 32G 
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na37^9 ACA-GCCIAAT GTCAAACtAA 'ACATCTrCCG TTTTTRAAAC CAACAAACAT 



consensus 
Seq 
OGA10 3 

OGAIIO 



sact 
oCAlLO 



GT-rA.CTATT 
OTTA CTAXT 
NTTA '-CtATT 
CTTA CTATT 
GTTA CTATT 
gtta.ctatt: 
cttaactatt 

CTTA. CTATT 
19C1 

CACATCACTCJ 
CACATCACTG 
TCACATCACTG 
CACATCACTG 
CACCTCACCG 



CATTGCACA 
CATTGG ACA 
CATGNGNACA 
CATTOG ACA 
CATTGG ^CA 
CATTGGCACA 
TCATGGGACA 
TCATC 



GATATCATTT 
GATATCATTT 

KATATCArrr 

CSATATCATTN 
GATATCATTT 

GOTATCcrrr 



TATG. .TATA 
TATG TATA 
TANA TATA 
HATG TATA 
TATG TATA 
TATCCTATTA 



3.9 0 0 
AATACTGTT. 
jkATACTGTT 
AACACTANT 
AAT 

AATACTGTT 

aatactgttc 



G-GAAAATGT 
G GAAAATGT 
GOTAAAAGAT 
G CAAAATQT 
aGGGNATtSGT 



X530 

AAACTXT.AA ACAXAATGCC ACAAOGXCAC 

AAAcrrr aa acataatqcc acaa gct cac 

AACrCTTT AA ACATAATACCCACA2rGTTCAC 
AA^CtTT AA ACATAATCCC ACAAGCTCAC 
AAACCrnJAA ACCTNATGGC CNCAGGGGCA 



i931 



oCAXOSl 

cGAiia 
aaA317S3rcc 



consensus 



pGAlOl 
Ssc 
oGAia 9 
oGAllQ 



TAATTTCTAG CAGGTAAAAT TATAACGATA TAAATTCCA^ 
TW.TTTCTAG CAGCTAAAAT TAT.XiGGATA TAAATTCCAA 
TAATTTCTW. OrOATSTAAAT TATA^TGGNTATAAAATTCCA^ 
TAA-^CTAC CAGGTAAAAT TATAAGGATA TAAATTCCAA 

GGTAAAAT TATAAC-GATA TAAATTCCAA. 

CCXTTTTyCG GCG 



2QO0 
TAATAAACCA 
TAATAAACCA 
TAATAAACCCA 
TAATAAACCA 
TAATAAACCA 



primer 445 -2-023 4 -IQ-? 

AGTAAACGCCAAGGTGATGGTXACTTAACGAT 
AATGTATTTA GAflTAXTTAT lAGTAAATGC AAGGTCATGT TAGTTATgAT 
.ACATATTTAAGAA-ATTTATrrAJlTAiCtGC =^5"^^ 



2050 



r;r^-V-TTi' aicTATTTA-r TACTJUATSC AAGGTGATGT TACTTTATGAT 
.«OX7Sar== :^4S;S^4?A ^^SSir T^GTAAATGC AAGGT=ATGT TAGXTAXGAX 
aal3S297rcc ?' 

2100 

«aaaggtag tgaaaaaatg 

ATAAAGCTAG GAAfVAAATG 
ATAAAGGTAC TGAAAAAATG 
ATAAAGGTAG TGAAAAAATG 
ATAAAGGTAG TGAAAAAATG 
ATAAAGGTAG TGAAAAAATG 
ATAAAGGTAG TGAAAAAATG 
ATAACCGTAC S*CAAAAAA£rG 



2C3Z 

.w.*-^^-— CAGTTATACT CTAAATATTr AATTT GTTTT 
pCAiaZ CAC-GTTAAAACCrCTAAAATATT^rPATnJTTTGTTTT 
Sei? CA3TTATACT CTAAATATTT AATTTGrTTT 
CAGTTATACT CTAAATATTT AATTTGTTTT 
CAGTTATACT CTAAATATTT AATTTGTTTT 
CAGTTATACT CNAAATATTJ? AATTTGTim: 
TATACT CTAAATATTT AATTTGTTTT 



OGAilO 
Aa77a22arc=: 



pGilCL 

OGAIIC 

aat53257rc:: 
*a77a22a rc:; 
ha2a33rcc 

r52l3Sr=c 



21C1. 

AjQ^TTTCCT X 



TTCCT ATTTATTAAA AAACATTAAA 



TTTC^ATTCC 



31S0 
. AA-^TGAGAT 



AAJLJ^rTTCCT iTTTATTAAA AAACATTAAP. 
AAAATTTGCT ATTTArXAAA AAACATTAAA 
AAAATTTCCT ATTTATTAAA AAACATTAAA 
i^AAATTTGCT ATTTATTAAA AAACATTAAA 
AAAATTTGCT ATTTATTAAA AAACATTAAA 
AAAATTGGCT ATTTATTAAA AAACATTGAA 
AAAATTWGCT ATTTATTAAA AAACATTAAA 

ATTAT:<a-=u!w AAACATTAA,^. 

AAACATTAAA 



TTTC ATTCC 
TTTC ATTCC 
TTTC ATTCC 
TTTC . ATTCC 
TTTC . ATTCC 
. TTTC . ATTCC 
TTTC . ATTCC 
TpCTC.ATBCS 
TGTCCAi^fGCC 



AAATGAGAl 
AAATGAGAT 
AAATGAGAT 
. AAATGAGAT 
. AAATGAGAT 
. A-^ATGAGAT 
CA.2u\TGAGAT 
. AJiATCACAr 
CAAATGAGAT 
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pGAlOl 

oGAllO 

aal53237rcc 
aia770729rcc 

dffoaisrcc 
rs2i3Srcc 



aa77a22arcc 
fi02a53rcc 
dSOaiSrcc 



caaaensus 

pGAlQl 

Seq 

aa431733rcc 

aa770223rcc 
h.aaas3rc<: 

rS213Srcc 



c:cnsetisu5 

acq 
oCAlia 
aa^317S337CC 
aAlSS237rcc 
aJl77a32Srcc 
ha29S32rs:c 

dtfoaisrcc 
r«2iasr<:<= 



CGAilO 
aaA2l.753rcc 
aal59297rcc: 
aa77a22a5'cc 
h02353rcc 

ra2ia3rcc 



ILSl 

AAGTd . ATAT 
W^CTG ATAT 
AACTG ATAT 
AACT3 ATAT 
AACTG . ATAT 
AAGTG . ATAT 
XXCrCC , ATAX 
A^GTG . ATAT 
AAGTG.ATAT 
AACTGGATAN 

2201 

CCCT, AAAAA 
CCCT AAAAA 
CCCT AAAAA 
CCCT AAAAA 
CCCT * JO ftA ft 
CCCT , VAAAA 
CCCT. AAAAA 
■ CCCT . AAJLiA 
CCCTRAAAAA 
CCCCT2XAAAA 

22S1 

ACCCATATTT 
ACCCATATTT 
ACCCATATTT 

p^eccATA Trr 

ACCCArATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 
ACCCATATTT 

3301 

CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTCTTCTCAT 
CTC^TTCTCAT 

T TTGT G ATGC 
TTTGTCATGC 
TTTGTGATGC 
TTTGTCATGC 
-TTGTGATCC 
r-rTGTGATGC 
TTTGTGATGC 
TTTGTGATGC 
TTTGTGATCC 
'TTTGTGATGC 
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TAC.TATAAC 
TAC TATAAC 
TAC TATAAC 
TAC TATAAC 
TAC.TATAAC 
TAC, TATAAC 
TAC.TATAAC 
TACCTATAAC 
TAC.TATAAC 
TACCTATAAC 



ACArrrCGrSA 

ACA?rr^ccA.\ 

ACATTTCGAA 
ACATTTGGAA 
AjCATTTCCAA 
AjCATTTGCAA 
ACATTTGGAA 
ACATTTGGAA 
ASATKTGGSA 
ACATTTGCJTA 



ACTTTACC . A 
ACTTTACC A 
ACTTTACC A 
ACTTTACC A 
ACTTTACC A 
ACTTTACC. A 
ACTTTACC. A 
ACTTTACC. A 
ACTTTACSSA 
ACTTTACC -A 



ATTTTCTTCT 
ATTTTCrrCT 
ATTTTCTTCT 
ATTTTCTTCT 
ATTTTCTTCT 
TTCTTCT 




ATC.TAAGCA 
ATC TAAGCA 
ATC TAAGCA 
ATC TAAGCA 
ATC.TAAGCA 
ATC.TAAGCA 
ATC.TAAGCA 
ATCCTAAGCA 
ATC.TAAGCA 
ATCCTAAGCA 



TATATOCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 
TATATGCTAT 



AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 
AATATATTTC 



TtGATGAAGA 
TtGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 
TTGATGAAGA 



TCATCT . . CA 
TCATCT GA 
TCATCT GA 
TCATCT GA 
TCATCT. ,CA 
TCATCT. .GA 
TCATCT. -GA 
TCATCT . - GA 
TCATCT. -GA 
TCATCCTGWA 



CTATAOATTC 
CTATAGATTC 
CTATACATTC 
CTATACATTC 
CTATAflATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGATTC 
CTATAGAKTC 
CTATAGATTC 



TCCTCACTGC 
TCCTCACTOC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 
TCCTCACTGC 



TATTTTTCAC 
TAXTTTTCAC 
TATTTTTCAC 
TATTTTTCAC 
TT 



TATT 
TATTTTTCAC 
TATTTTTCAC 
TATTTTtCAC 
TATTTTTCAC 
TATTTTTCAC 



CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT 
CCTCTTGGTT. 
CCTCTTGGtr: 



TTCATA-CTTT AAAATCTGTC 
TTGATACTTT AAAATCTGTG 
TTGATACTTT A->AATCTGTG 
TTGATACTT A^AAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGATACTTT AAAATCTGTG 
TTGAT.ACTTT AVAATCTGTC 
TTGATACTTT AAAATCTGTG 
TTCAT.VCTTT ;^AAATCTCTC 
TTGATACTTT AAAATCTGTG 



3200 
TTTQ^ATATT 
TTTG ATATT 
TTTG ATATT 
TTTG AXATT 
Trrc . ATATT 
TTTG.ATATT 
TTTG . ATATT 
TTTG.AXAHT 
TTTG.ATATY 
TTTGS^A^IAirr 

22S0 
AflTATCTACT 
AGTATCTACT 
AGTATCTACX 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 
AGTATCTACT 

2300 
ATAAGGACTA 
ATAAGOACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAWSACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 
ATAAGGACTA 

23S0 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 
CAAAGTTTAT 

2400 
CCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 
GC3.CCCGTTC 
CCACCCGTTC 
GCACCCGTTC 
GCACCCGTTC 



IIGOTE 321 
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Seq 
OGAIIO 

aal55237rcc 
rS2l3Srcc 



1^01 

TACAXGAATT 
TJiCATCAATT 
TACATGAATT 
TACATGAATt 
TACATGAATT 
TACATGAATT 
TACATGAATT 
TACAXOAATT 
TACATGAATT 
TACATGffXATT 



ATCAATACTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 
ATCAATAm 
ATCAATATTT 
ATCAATATTT 
ATCAATATTT 



CCrAAATTCA 
•CCTAAATTCA 
GCTAAATTCA 
GGTAA TTCA. 
GCTAAATTCA 
GGTAAATTCA 
GCTAAATTCA 
GGTAAATTCX 
GGTAAAXTCA 
GGTAPATTCA 



ATCTSTArrr 

ATCTCTAITT 
ATCTGTATTT 

ATCTCTArrr 
ATCrCTATTT 
ATCTCXATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 
ATCTGTATTT 



2450 

Gj^i^GTTAA 
GTTTTGT TAA 
OTTrTGTTAA 
CTTTCGXAAA 
GTTTTGTTAA 
GrrTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTTAA 
GTTTTGTXAA 



pCAiO 1 
(2GA1.10 

aa43i753rcc 
aalS3337rc:c 
aaTTQzzarcc 
h023S3r=c 

r5213Srcc 



2^31 2433 

ACT'CAAAAAT CTCATTTTCC , 

AGTCAAAAAT CTCATTTTCC AGTCGACCCC GCCCC 
ACTCAAAAAT CTCATTTTCC AAAAAAAAAA AAA-^AAAUCT CCAC 
ATCCAAAAAT2CJirCATT ■ 

ACTCAAAAAT CTCATTTTCC AAAA 

a^CAAAAAT CTCArmCC 

ACTCAAAAAT CTCATTT 

AGTCAAAAAW iTTCAA-WTCC 

AGTOtAAAAT CTCATTTTCC 

ACTT^A^TNANT CTCATTTTCC AA2rA::GGCGG GCGCGGGGGA AGTTCCTG 
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